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IT— < 




ORF1475 polypeptide sequence 
SEQ ID KO:2950. 






1579 


gi 143279 
15 


jHomo sapiens 


clone MGC:111S6 
IMAGE:3844322, mRNA 7 
complete cds. 


3446 


100 


1579 


gil40431 
03 


^Homo sapiens 


clone MGC: 15388 
IMAGE:3350378, mRNA, 
complete cds. 


3446 


100 


1579 


AAB947 
22 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO; 15741. 


3439 


99 


1580 


gtl34461 
90 


Homo sapiens 


Human DNA sequence from 
cloneRP4-7I7M23 on 
chromosome 20 Contains the 
gene encoding a CRP2 binding 
protein (CRP2BP), a 
pseudogene, ESTs, STSs, GSSs 
and CpG islands, complete 
sequence. 


3955 


97 


1580 


gil40431 
03 


Homo sapiens 


clone MGC: 15388 
IMAGE:3350378 ? mRNA, 
complete cds. 


3952 


97 


1580 


gil 43279 
15 


Homo sapiens 


clone MGC:lll86 
IMAGE:3844322 J mRNA, 
complete cds. 


3281 


96 


1581 


gil65516 
10 


Homo sapiens 


cDNA FLJ31697fis, clone 
NT2RI2005851, weakly similar 
to PLECTIN, 


1911 


99 


1581 


AAM664 
35 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ID NO: 26741. 


588 


100 


1581 


AAM540 
44 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 26149. 


588 


100 


1582 


AAU204 
43 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 435. 


940 


94 


1582 


AAM937 
13 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3654. 


756 


100 


1582 


gil71490 
39 


Homo sapiens 


MTOI-Hke protein gene, 
complete cds; nuclear gene for 
mitochondrial product. 


756 


100 


1583 


ABB1222 
0 


Homo sapiens 


HYSE- Human peroxisomal Ca- 
dependent solute carrier 
homologue, SEQ:2590. 


344 


100 


1583 


gil28536 
85 


Mus rnusculus 


putative 


168 


52 


1583 


AAM800 
61 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3707. 


165 


55 , 


1584 


gil 43491 
25 


Homo sapiens 


rnRNA for alpha2- 
glucosyltransfci ase ( ALG 1 0 
gene). 


716 


93 


1584 


gi351345 
1 


Rattes 
norvegicus 


potassium channel regulator 1 


681 


90 


1584 


AAB257 
15 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 3 
SEQ ID NO: 104. 


617 


93 


1585 


gil26979 
39 


Homo sapiens 


mRNA for KJAAI697 protein, 
partial cds. 


1904 


100 



206 



WO 02/070539 



PCT/1JS02/05095 



SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1585 


gi 1 04402 
49 


Homo sapiens 


cDNA: FLJ23529 fis, clone 
LNG06042. 


1897 


99 


1585 


gi72934l 
5 


Drosophila 
meianogaster 


Dhcl6F gene product 


786 


44 


1586 


AAZ3583 
4_aal 


Homo sapiens 


1NCY- Human vesicle 
trafficking protein 2 encoding 
cDNA. 


804 


84 


15S6 


AAB936 
64 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:13188. 


804 


84 


1586 


AAY499 
59 


Homo sapiens 


INCY- Human vesicle 
trafficking protein 2, 


804 


84 


1587 


AAY765 
61 


Homo sapiens 


META- Human ovarian tumor 
EST fragment encoded protein 
57. 


623 


92 


1587 


gi 136235 
85 


Homo sapiens 


Similar to RIKENcDNA 
1500034E06gene, clone 
MGC:14151 IMAGE :3690202, 
mRNA, complete cds. 


623 


92 


1587 


gil28586 
76 


Mus musculus 


putative 


595 


88 


1588 


AAU206 
47 


Homo sapiens 


HUMA- Human secreted 
protein, Seq ID No 639* 


927 


99 


1588 


AAU205 
23 


Homo sapiens 


HUM A- Human secreted 
protein ? Seq ID No 515. 


927 


99 


1588 


AAB954 
32 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17854. 


927 


99 


1589 


AAB591 
91 


Homo sapiens 


UYCO Human NADE. 


293 


57 


1589 


gi845289 
4 


Homo sapiens 


p75NTR~associated cell death 
executor (NADE) mRNA, 
complete cds. 


293 


57 


1589 


gi 189379 


Homo sapiens 


Human unknown protein from 
clone pHGR74 mRNA, 
complete cds. 


293 


57 


1590 


AAG017 
16 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5797. 


894 


99 


1590 


AAY824 

73 


Homo sapiens 


LLTL- Human APG12 protein 
sequence. 


724 


100 


1590 


gi4U573 
1 


Homo sapiens 


mRNA for Apgl2, complete cds. 


724 


100 


1591 


AAW877 
01 


Homo sapiens 


INCY- A human membrane 
fusion protein designated 
SYTAX1. 


1357 


99 


1591 


gi420024 
1 


Homo sapiens 


H.sapiens gene from PAC 42616, 
similar to syntaxin 7. 


1325 


100 


1591 


gil47150 
19 


Mus musculus 


Unknown (protein for 
MGC:6471) 


1280 


93 - 


1592 


AAA540 
89 aal 


Homo sapiens 


GETH PR021 1 cDNA. 


1944 


87 


1592 


AAB530 
75 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0211, SEQ 
IDNO:57. 


1944 


87 


1592 


AAB612 
31 


Homo sapiens 


MILL- Human TANGO 331 
protein. 


1944 


87 


1593 


gi668159 
2 


Homo sapiens 


HS J2 mRNA for DnaJ homolog, 
complete cds. 


1567 


93 


1593 


gil28032 


Homo sapiens 


MRJ gene for a member of the 


1567 


93 
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63 




DNAJ protein family, clone 
MGC:1 152 IMAGE :33 46070, 
mRNA, complete cds. 






1593 


AAW940 
66 


Homo sapiens 


INC Y- Human DnaJ-like 
protein, HSPJ2. 


1516 


98 


1594 


gil 12308 

58 


Homo sapiens 


mRNA for HMG-box 
transcription factor TCF-3, 
complete cds* 


3096 


100 


1594 


gi3 12363 
8 


Mus musculus 


TCF-3 protein 


2934 


95 


1594 


gil42799 
82 


Xenopus laevis 


T-ceil factor XTCF-3 


2195 


77 


1595 


gil63070 
74 


Homo sapiens 


hypothetical protein FLJ22724, 
clone MGC: 16791 
IMAGE:3900548, mRNA, 
complete cds. 


932 


100 


1595 


gi 104392 

25 


Homo sapiens 


cDNA: FLJ22724 fis, clone 
HSI14868. 


932 


100 


1595 


gil28543 
96 


Mus musculus 


putative 


618 


68 


1596 


gi!00472 
29 


Homo sapiens 


mRNA for KIAA1577 protein, 
partial cds. 


3874 


100 


1596 


gi795928 
3 


Homo sapiens 


mRNA forKIAA1511 protein, 
partial cds. 


3002 


74 


1596 


gi 104404 
18 


Homo sapiens 


mRNA for FLJ00044 protein, 
partial cds T 


2010 


63 


1597 


AAY131 
17 


Homo sapiens 


GEST Human secreted protein 
encoded by 5' EST SEQ ID NO: 
131. 


268 


100 


1597 


gil38164 
07 


Sulfolobus 
solfataricus 


Dehydrogenase, putative 


66 


43 


1597 


gil5l400 
82 


Sinorhizobium 
meliloti 


HYPOTHETICAL PROTEIN 


66 


37 


1598 


gil69437 
20 


Homo sapiens 


mRNA for SOX7 protein. 


2106 


100 


1598 


gll65503 
14 


Homo sapiens 


cDNA FLJ30994 fis, clone 
HLUNG1 000076, highly similar 
to Mus musculus mRNA for 
mSox7. 


2106 


100 


1598 


gil3279l 
64 


Homo sapiens 


Similar to SRY-box containing 
gene 7, clone MGC: 10895 
IMAGE:3622936, mRNA, 
complete cds. 


2106 


100 


1599 


gil76461 
46 


Homo sapiens 


B lymphocyte activation-related 
protein mRNA, complete cds. 


1111 


97 


1599 


gil 20062 
23 


Homo sapiens 


NPD017 mRNA, complete cds. 


1111 


97- 


1599 


giI5I267 
45 


Homo sapiens 


hypothetical protein FLJ23 174, 
clone MGC:5372 
IMAGE:3445403, mRNA, 
complete cds. 


1105 


97 


1600 


gil43288 
79 


Homo sapiens 


CUB domain containing protein 
1 (CDCP1) mRNA, complete 
cds. 


4394 


100 


1600 


gil0439S 
15 


Homo sapiens 


cDNA: FLJ22969 fis, clone 
KAT10759. 


4385 


99 


1600 


AAY914 


Homo sapiens 


HUMA- Human secreted protein 


3633 


98 1 
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56 




sequence encoded by gene 6 T 
SEQIDN0:129. k 




1601 


gil20528 
46 


Homo sapiens 


mRNA; cDNA 

DKFZp564K2464 (from clone 
DKFZp564K2464); complete 
cds. 


r 2 
i 

D 


100 


1601 


gi 126527 
23 


Homo sapiens 


clone MGC:3295 
IMAGE;3508204, mRNA, 
comnlete cds 

VV/liiil/lvVv Wit? » 


2035 


100 


1601 


AAW790 
88 


Homo sapiens 


GEMY Human secreted protein 
oil 29 2, 


1115 


100 


1602 


gi50816I 
0 


Mus musculus 


huntington yeast partner C 


4295 


94 


1602 


gi 156368 
98 


Gallus gallus 


formin binding protein 11- 
related protein 


2507 


55 


1602 


gi508160 
8 


Mus musculus 


■fivrtnin tiiti/linu TvrrvtpSn 1 1 
lUUtllll UllLUUlg yiuiciii 1 J 




54 


1603 


gi508l61 
0 


Mus musculus 


iiuiit-iiigivji ycobt paruici 




90 


1603 


gi680803 
8 


Homo sapiens 


mRNA; cDNA 

f)K , P7n/i , ?dH9 191 ffrnm riant* 

DKFZp434H2121); partial cds. 


2341 


100 


1603 


gil56368 
98 


Gallus gallus 


frtrtYiifi ntnmnn rvrttt^tn 1 1 _ 
1U1I11IU UlUUlllg piUlClfl i 1" 

related protein 




52 


1604 


AAB950 
53 


Homo sapiens 


HEU- Human protein sequence 


680 


100 


1604 


gi 104335 
25 


Homo sapiens 


cDNA FLJ12122fis, clone 
MAMMA1000129. 


680 


100 


1604 


AAE0609 
6 


Homo sapiens 


HUMA" Human gene 56 

pnnnrl&A QPrr&t&ri nrnfi^in 
tiivvuciu acwcicu yiuian 

HRABA80, SEQ ID NO: 1 58. 


151 


48 


1605 


AAB943 
09 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14777. 


2318 


99 


1605 


gil04345 
01 


Homo sapiens 


NT2RP2002079, weakly similar 
to HISTONE HI, GONADAL. 




99 


1605 


gi 17391 2 
25 


Mus musculus 


Similar to hypothetical protein 
FLJ12800 


1515 


70 


1606 


AAG645 
02 


Homo sapiens 


UYFU- Human EPIP2 
(endometrial progesterone- 
induced protein). 


1914 


100 


1606 


gi 173902 
89 


Homo sapiens 


Similar to phosphoserine 
aminotransfei^scj clone 
MGC;9290 IMAGE:3883843, 
mRNA, complete cds. 


1914 


100 


1606 


gil34360 
74 


Homo sapiens 


Similar to phosphoserine 
aminotransferase, clone 
MGC:10519 IMAGE:3938160 3 
mRNA, complete cds. 


1914 


100 


1607 


gi532680 
2 


Homo sapiens 


phosphoserint; aminotransferase 
(PSA) mRNA, complete cds. 


1673 


100 


1607 


AAG645 
02 


Homo sapiens 


UYFU- Human EP1P2 
(endometrial progesterone- 
induced protein). 


1616 


87 


1607 


gil73902 
89 


Homo sapiens 


Similar to phosphoserine 
aminotransfej*ase> clone 
MGC:9290 IMAGE:3883843, 


1616 • 


87 
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mRNA, complete cds. 






1608 


gil 34477 
61 


Homo sapiens 


cystatin and DUF19 domain- 
containinAprotein 1 
(CSDUFm) mRNA, complete 
cds. » 


736 


100 


1608 


AAG745 
13 


Homo sapiens 


WMA- Human colon cancer 
antigen protein SEQ ID 
NO;5277. 


570 


100 


1608 


AAB937 
98 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13564. 


281 


47 


1609 


gil04402 
30 


Homo sapiens 


cDNA: FLJ23514fis, clone 
LNG04628. 


2005 


100 


1609 


gil28529 
73 


Musmusculus 


putative 


1509 


69 


1609 


gil24077 
49 


Arabidopsis 
thaliana 


initiation factor 3a 


138 


22 


1610 


AAY649 
94 


Homo sapiens 


GEST Human 5* EST related 
polypeptide SEQ ID NO: 1155. 


372 


100 


1610 


AAM008 
52 


Homo sapiens 


HYSE- Human bone marrow 
protein* SEQ ID NO: 215. 


69 


39 


1610 


gi332565 


Orf virus 


ORF2 


68 


42 


1611 


gil04397 
05 


Homo sapiens 


cDNA: FLJ23121 fis, clone 
LNG07996. 


3137 


100 


1611 


AAR949 
96 




HRT.T- Hiimnn r*rnt"fMn *!Piiiif*titf > P 

SEQ ID NO: 16625. 


161ft 


99 


161 1 


57 




cDNA FLT1 1 8K9 fte clone 
HEMBA100725 1, weakly 
similar to Homo sapiens F-box 
protein FBX29 (FBX29) mRNA. 


1618 


99 


1612 


AAB952 
34 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17375, 


3584 


100 


1612 


gil 04346 
74 


Homo sapiens 


cDNA FLJ12911 fis, clone 
NT2RP2004425, highly similar 
to Mus musculus axotrophin 
mRNA, 


3584 


100 


1612 


gi505203 
1 


Mus musculus 


axotrophin 


2983 


85 


1613 


gi965095 
4 


Mus musculus 


beta-l,6-N- 

acetylglucosaminyltransferase B 


1254 


73 


1613 


AAV160 
OOjral 


Homo sapiens 


LJOL- Human beta-I,6-N- 
acetylglucosaminyltransferase 
(IGnT) encoding cDNA. 


1044 


65 


1613 


AAQ892 
01 aal 


Homo sapiens 


LJOL- 1- branching enzyme 
cDNA. 


1044 


65 


1614 


gil 00473 
11 


Homo sapiens 


mRNA for KIAA1617 protein, 
partial cds. 


4792 


100 


1614 


gi663535 
3 


Homo sapiens 


RUl (RUl)rnRNA, complete 
cds. 


2467 


55 


1614 


gil 57790 
95 


Homo sapiens 


Similar to RUl, clone 
MGC:3342 IMAGE:3029598, 
mRNA, complete cds. 


2467 


55 


1615 


gi46S069 
3 


Homo sapiens 


CGI-27 protein mRNA, 
complete cds. 


1414 


92 


1615 


gii75H7 
62 


Homo sapiens 


CGI-27 protein, clone 
MGC;31852 IMAGE:4851517, 
mRNA, complete cds. 


1414 


92 


1615 


gil 70463 


Homo sapiens 


C21orfl9-like protein mRNA, 


1414 


92 
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03 


-% 


complete cds. 






1616 


gi 100472 
69 


Hfimo sapiens 


mRNA for KIAAI597 protein, 
partial cds* 


4638 


99 


1616 


gil 36470 
69 


lVws musculus 


synaptotagm in-like protein 2-a 
delta 2S-III 


3717 


81 


1616 


gil 36470 
09 


Mus musculus 


synaptotagmin-like protein 2-a 


3666 


77 


1617 


AAM937 
72 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3778. 


1945 


98 


1617 


ABR1 17^ 

1 




j.1 1 on- riuiiiau grauupmiin-a 
homologue, SEQ ID NO;2 1 01 * 


1 0/LS 

1 7*tJ 




1617 


gil 59302 

1 o 


Homo sapiens 


synaptotagmin-like 2, clone 

IVILIv^.Vjoo livlrVVJC.JOo ID /U, 

mRNA, complete cds. 


1945 


98 


1618 


9 




gcriuniic l/inAj cnroiiiosome 
21q, section 89/105. 


77/1*7 


IUU 


1618 


81 


ft /J I1C TYt 11C/M1 III C 


pUlcLUVC 




HQ 


1618 


gi488438 

A 
\j 


Homo sapiens 


mRNA; cDNA 

UJvrZL.pjoOrU'ti^ ^ironi cione 
DKFZp586F0422); partial cds. 


870 


80 


1619 

JU17 


A AR99S 

46 


nuiiiu sapiens 


lnjcmjjv- iruntdieu L/opamine ui 
receptor encoded by pseudogene 

rifin** m „3Q 
UIUIIC vJL"J7. 


113/ 


o*i 


1619 


AAR210 
82 


Homo sapiens 


NEUR- Dopamine Dl receptor 


1028 


85 


1619 


gi32049 


Homo sapiens 


Human HD5DR gene for D5 
dopamine receptor. 


1028 


85 


1620 


AAEI032 
9 


Homo sapiens 


INCY- Human transporter and 
protein. 


1339 


92 


1620 


AAM258 
77 


Homo sapiens 


HYSE- Human protein sequence 


1284 


100 


1620 


gil28565 
98 


Mus musculus 


putative 


382 


64 


1621 


78 


l-Tnmn ^&ni/*n^ 


SEQ ID NO: 14707. 




t f\f\ 

IUU 


1621 


48 


f-Tnmrt r cnnipn f i 


lljLIXJlllXJlUjtll CII ULCIII 1 Lifl 1 £. f yJU, 

clone MGC:469l 
IMAGE:3533384j mRNA, 
complete cds. 


ktQD 


tf>n 

LUU 


1621 


giI04344 
28 


Homo sapiens 


cDNA FLJ 12750 fis, clone 
NT2RP2001 168, weakly similar 
to VERPROLIN. 


1465 


1 00 


1622 


AAY026 
69 


Homo sapiens 


HUM A- Human secreted protein 
encoded by gene 20 clone 
HMKAH10. 


288 


100 


1622 


gi751042 
1 


Caenorhabditis 
elegans 


hypothetical protein Y6G8. 1 - 
Caenorhabditis elegans > 


66 


37 


1623 


AAB953 
93 


Homo sapiens 
* 


HELI- Human protein sequence 
SEQ ID NO: 17745. 


795 


100 


1623 


gil04352 
17 


Homo sapiens 


cDNA FLJ13265 fis, clone 
OVARC1000937. 


795 


100 


1623 


AAB570 
19 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
ID NO: 1597. 


275 


9l 


1624 


AAG892 


Homo sapiens 


GEST Human secreted protein, 


936 


100 
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8i 




SEQ ID NO: 401. 






1624 


AAIH59 
32 


Homo sapiens 


HUM A- Human novel secreted 
protein, Seq ID 885. 


936 


100 


1624 


gil04391 
93 


Homo sapiens 


cDNA: FLJ22700 fis, clone 
HSI12073. 


936 


100 


1625 


gil28579 
64 


Mus museums 


putative 


1533 


86 


1625 


gil 0241 3 
97 


Homo sapiens 


Human DNA sequence from 
clone RP3-336K20 on 
chromosome 6 Contains parts of 
2 genes for novel proteins^ ESTs^ 
STSs and GSSs> complete 
sequence. 


964 


100 


1625 


gil65523 
03 


Homo sapiens 


cDNA FLJ32234 fis ) clone 
PLACE6004687. 


721 


97 


1626 


gil 04381 
58 


Homo sapiens 


cDNA: FLJ21940 fis, clone 
HEP04512* 


3307 


99 


1626 


AAG737 
12 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4476. 


455 


98 


1626 


gil 04406 
14 


Oryza sativa 


putative ATP-dependent RMA 
helicase 


452 


32 


1627 


gi!04378 

^7 

j i 


Homo sapiens 


cDNA:FLJ21687fis, clone 


1466 


100 


1627 


gi618017 
g 


Homo sapiens 


transcription factor IGHM 
c>iiiidnLer javii i protein, jivi^r 
protein, JM5 protein, T54 

pivlf^tllf JiViHJ piCflClfj, rYH 

differentiation-dependent 
Drotein trinie I JM domain 
protein 6, and synaptophysin 
genes, complete cds; and L-iype 
calcium channel alpha- 1 subunit 
gene, partial cds^ complete 
sequence* 


1182 


98 


1627 


ABB1156 
1 


Homo sapiens 


HYSE- Human JM10 protein 
homologue, SEQ IDNO:1931. 


947 


100 


1628 


AAM252 
27 


Homo sapiens 


HYSE- Human protein sequence 
SEQ IDNO:742. 


2069 


100 


1628 


AAB853 
62 


Homo sapiens 


1NCY- Human phosphatase (PP) 
(clone ID 2522707CD1). 


2021 


100 


1628 


gil 50805 
05 


Homo sapiens 


Similar to RIKENcDNA 
5730568A12 gene, clone 
MGC:1765I IMAGE:3857480, 
mRNA, complete cds. 


1907 


100 


1629 


gi656284 
5 


Rattus 
norvegicus 


type A/B hnRNP p40 


1661 


91 - 


1629 


gi337727 
9 


Rattus 
norvegicus 


A1F-C1 


1654 


91 


1629 


gi!81427 
4 


Homo sapiens 


Human apobec-1 binding protein 
1 mRNA, complete cds. 


1631 


92 


1630 


AAY360 
S3 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID N(X 
468. 


430 


98 


1630 


AAG005 
83 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4664. 


414 


100 


1630 


£1358259 


Cnernidophorus 


NADH dehydrogenase subunit 4 


66 


37 
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4 


tigris 








1631 


AAB102 
84 


Homo sapiens 


GEMY Human fetal placenta 
protein fragment AC175_2i. 


852 


94 


1631 


gi398346 
3 


Homo sapiens 


microfibril-associated 
glycoprotein 2 (MAGP2) gene, 
exon 10 and complete cds. 


852 


94 


1631 


gil35434 
86 


Homo sapiens 


Microfibril-associated 
glycoprotein-2, clone 
MGC:14490 IMAGE:4247343, 
mRNA, complete cds. 


852 


94 


1632 


gil61984 
56 


Homo sapiens 


Similar to RIKENcDNA 
0610040E02gene, clone 
MGC:17973 IMAGE:3919892, 
mRNA, complete cds. 


1339 


99 


1632 


AAY026 
61 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 12 clone 
HFTCU19. 


1142 


99 


1632 


gil 67406 
89 


Mus musculus 


RIKEN cDNA 0610040E02 
gene 


1059 


77 


1633 


AAB530 
94 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0826, SEQ 
ID NO: 158. 


510 


100 


1633 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


510 


100 


1633 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ ID NO:20L 


510 


100 


1634 


AAB530 
94 


Homo sapiens 


GETH Human angiogcncsis- 
associated protein PR0826> SEQ 
ID NO: 158. 


413 


85 


1634 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


413 


85 


1634 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ IDNO:201. 


413 


85 


1635 


gil04390 
08 


Homo sapiens 


cDNA: FLJ22573fis, clone 
HSI02387. 


1937 


100 


1635 


AAM878 
76 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO:15469. 


187- 


45 


1635 


gi 133464 
3 


Xenopus laevis 


APEG precursor protein 


93 


37 


1636 


gi 104390 
08 


Homo sapiens 


cDNA: FLJ22573 fis, clone 
HSI02387. 


578 


100 


1636 


AAM878 
76 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQ ID NO: 15469, 


155 


94 


1636 


gi394136 
5 


Homo sapiens 


I-REL gene, exon 12 and 
complete cds. 


79 


37 


1637 


AAM938 
71 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3980. 


3761 


99 


1637 


gi!433Il 
31 


Homo sapiens 


semderin mRNA, complete cds. 


3749 


99 


1637 


AAR804 
81 


Homo sapiens 


NAKA/ Recombinant human 
adseverin. 


3527 


92 


1638 


gil 04369 
70 


Homo sapiens 


cDNA: FLJ20991 fis, clone 
CAE02103. 


239 


67 



213 



WO 02/070539 



PCT/US02/0509S 



SEQ 
ID 


Hit ID 


Spcicies 


Description 


S score 


Percent 
identity 


1638 


gi646024 
0 


Deinococcus 
radiodurans 


DNA-binding response regula&r 


86 


37 


1638 


gil67548 
77 


Cyprinus carpto 


Smad4 type4 


79 


27 


1639 


AAY849 
01 


Homo sapiens 


INCY- A human proliferation » 
and apoptosis related protein. 


2821 


95 


1639 


gil20532 
25 


Homo sapiens 


mRNA; cDNA 

DKFZp434P2235 (from clone 
DKFZp434P2235); complete 
cds. 


2806 


95 


1639 


gi37330 


Homo saoiens 


H. sapiens mRNA for tre 
oncogene (clone 210). 


2053 


7ft 


1640 


AAY849 
01 


Homo saniens 


rMf""V~ A humfiti nrnlifhrfitinn 

and apoptosis related protein. 


2846 


yo 


1640 


gil20532 
25 


Homo saniens 


mRNA- cDNA 

DKFZp434P2235 (from clone 
DKFZn434P2235V complete 
cds. 


2834 




1640 


gi37330 


Homo sapiens 


H saniens mRNA for tre 
oncogene (clone 210). 


2050 


77 


1641 


ei 139362 
85 


Mus musculus 


TRH4 


1522 


77 


1641 


eil28455 
40 


Mus musculu*? 


niffftti vp 


1520 


77 


1641 


AAU007 
82 


Homo sapiens 


INCY- Human apoptosis 
protein, APOP-2. 


1345 


98 


1642 


gi!72253 
31 


Homo sapiens 


MY0876G05 protein (MY876) 
mRNA, complete cds. 


1209 


100 


1642 


gi 120020 
42 


Homo sapiens 


brain my048 protein mRNA, 
complete cds. 


1209 


100 


1642 


ci!76461 
46 


Homo ^aniens 


B Ivttinhocvte acttvation-rplatpd 

protein mRNA, complete cds. 


91 1 


7ft 


1643 


gU04373 
07 


Homo sapiens 


cDNA: FLJ21240 fis t clone 
COL01132. 


2090 


100 


1643 


AAB747 
30 


Homo sapiens 


INCY- Human membrane 
associated protein MEMAP-36. 


856 


42 


1643 


AAY949 
06 


Homo sapiens 


GEMY Human secreted protein 
clone rb649 3 protein sequence 
SEQ ID NO: 18. 


853 


42 


1644 


AAY144 
48 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 38 clone 
HFGAH44. 


316 


100 


1645 


eil03344 
43 




T-Tnmnn TYWA «f*niiPttf*f* frntn 

clone RP11-291L22 on 
chromosome 10 Contains the 3' 
end of the HSD17B7 
(hydroxysteroid (17- beta) 
dehydrogenase 7) gene, part of a 
gene similar to CDC 10 (ceil 
division cycle 10, S. cerevisiae, 
homolog), part of a novel gene, a 
novel pseudogene, STSs, GSSs 
and a CpG Island, complete 
sequence. 




1 00 


1645 


gi560623 


human, fetal 
lung, mRNA, 
2314ntl. fHomo 


hCDC10=CDCL0 homolog 


236 


72 



214 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 


Hit ID 


Spcicics 


Description 


S score 


Percent 
identity 






sapiens ^ 








1645 


gi286460 
6 


Mus musculus* 


CDC 10 


236 


72 


1646 


AAB189 
69 


Homo sapiens V 


INCY- Amino acid sequence of 
a human transmembrane protein. 


1707 


95 


1646 


AAB495 
04 


Homo sapiens ^ 


HUMA- Clone HNTMH27, 


1370 


94 


1646 


gil 63075 
93 


Mus musculus 


RIKEN cDNA 22 1 0021 G2 1 

gene 


1325 


90 


1647 


gtl00472 
31 


Homo sapiens 


mRNA for KIAA1578 protein, 
partial cds. 


2083 


95 


1647 


gi684119 
4 


Homo sapiens 


HSPC272 


281 


81 


1647 


gtl08003 
75 


Caenorhubditis 
elegans 


Hypothetical protein Y67D8C.5 


192 


21 


1648 


gU43493 
55 


Homo sapiens 


hypothetical protein FLJ23323, 
clone MGC: 14873 
IMAGE:3948222, mRNA, 
complete cds. 


1771 


100 


1648 


gil 04399 
69 


Homo sapiens 


cDNA: FLJ23323 fis f clone 
HEP12456. 


1771 


100 


1648 


gil28525 
02 


Mus musculus 


putative 


1540 


65 


1649 


AAY413 
60 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 53 clone 
HJPAD75. 


490 


100 


1649 


AAM244 
06 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1931. 


335 


100 


1649 


AAY414 
70 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
gene 53. 


84 


100 


1650 


AAY413 
60 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 53 clone 
HJPAD75, 


267 


63 


1650 


AAM244 
06 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO; 1931. 


199 


90 


1650 


AAY414 
70 


Homo sapiens 


HUMA- Fragment of human 
secreted protein encoded by 
gene 53. 


84 


100 


1651 


gi 165503 
12 


Homo sapiens 


cDNA FU30993 fis, clone 
HLUNG1000064, weakly 
similar to KARYOGAMY 
PROTEIN KAR4. 


2449 


100 


1651 


gil63068 
92 


Homo sapiens 


clone MGC:2902 
IMAGE:3010654, mRNA, 
complete cds. 


2449 


100 


1651 


gil39385 
95 


Homo sapiens 


Similar to CG7818gene 
product, clone MGC :453 1 
IMAGE:3010654, mRNA, 
complete cds. 


2449 


100 


1652 


AAG671 
58 


Homo sapiens 


MILL* Amino acid sequence of 
a human 20685 transporter 
polypeptide. 


1586 


100 


1652 


gil 32741 
22 


Homo sapiens 


Human DNA sequence from 
clone RP1-55C23 on 
chromosome 6q22.3-23.3 
Contains the VNNI and VNN2 


1586 


100 



215 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


I 


) 




genes for vanin 1 and 2, the gene 
for vanin 3 (VNN3), a HLF 
(hepatic leukemia factor) 
pseudogene, a CCNGl (cyclin 
GI) pseudogene* the 3' part of a 
novel gene, ESTs, GSSs, and 
STSs, complete sequence. 






1652 


gil57958 
17 


Homo sapiens 


unnamed protein product 


1586 


100 


1653 


AAB541 
63 


Homo sapiens 


HUMA- Human pancreatic 
cancer antigen protein sequence 
SEQ 1DN0:615. 


322 


63 


1653 


gi 180886 


Homo sapiens 


Human colipase mRNA, 
complete cds. 


322 


63 


1653 


gil73897 
64 


Homo sapiens 


colipase, pancreatic, clone 
MGC:23801 IMAGE:4251084, 
mRNA. complete cds. 


322 


63 


1654 


AAU171 
98 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein, 
Seq ID 763. 


1359 


100 


1654 


gi 159874 
93 


Homo sapiens 


tumor endothelial marker 6 
(T£M6) mRNA, complete cds. 


1359 


100 


1654 


gi!43257 
70 


Homo sapiens 


mRNA for thyroid specific PTB 
domain protein, complete cds. 


1359 


100 


1655 


AAG01 1 
18 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5199. 


360 


IO0 


1655 


gi 1541 89 
66 


Xenopus laevis 


annexin 4 


66 


44 


1656 


gil04370 
31 


Homo sapiens 


cDNA: FU21034fis, clone 
CAE09073. 


1403 


100 


1656 


gil63071 
59 


Mus musculus 


Unknown (protein for 
IMAGE:3493084) 


1060 


50 


1656 


gil38795 
51 


Mus musculus 


Unknown (protein for 
IMAGE:3709003) 


1060 


50 


1657 


AAM800 
47 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3693, 


820 


91 


1657 


AAM790 
63 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1725. 


820 


89 


1657 


ABB1214 
4 


Homo sapiens 


HYSE- Human HSPP-29 protein 
homologue, SEQ ID NO:2514. 


820 


91 


1658 


gi581382 

3 


Homo sapiens 


SUI1 isologmRNA, complete 
cds. 


470 


86 


1658 


gi450281 


Homo sapiens 


suilisol mRNA, complete cds. 


470 


86 


1658 


gil42505 
20 


Homo sapiens 


putative translation initiation 
factor, clone MGC: 15684 
IMAGE:3350981,mRNA 3 
complete cds. 


470 


86 


1659 


gil68771 
87 


Homo sapiens 


clone MGC: 17299 
IMAGE:3845811 > mRNA ) 
complete cds. 


1094 


100 


1659 


AAyi29 

52 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted peptide. 


362 


98 


1659 


gi239437 
6 


Cercopithecus 
aethiops 


thromboxane A2 receptor; 
TBXA2R 


94 


29 


1660 


gi 1043 89 
46 


Homo sapiens 


cDNA: FLJ22527 fis, clone 
IIRC 12820. 


1017 


97 


1660 


gi 165493 


Homo sapiens 


cDNA FLJ30I49 fis ? clone 


665 


90 



216 



WO 02/070539 



PCT/IJS02/05095 



SEQ 
ID 


Hit ID 


Speicies 


DcsrrinHfvii 

1 I|I11U1I 


o score 


rertcnu 
identity 




12 




BRACE2000280, weakly similar 
to MNN4 PROTEIN. 






1660 


gi729920 
7 


Drosophila 
melanogaster 


CGI 6789 gene product 


519 


49 


1661 


AAB955 
72 


Homo sapiens 


HELI* Human protein sequence 
SEQ ID NO: J 8221. 


2231 




1661 


gi 104359 
27 


Homo sapiens 


cDNA FLJ 13798 fis, clone 
THYROI000124. 


2231 


I Oft 


1661 


gil28516 
20 


Mus musculus 


putative 


1745 


77 


1662 


AAY482 
56 


Homo sapiens 


MET A- Human prostate cancer- 

UililUUluLEU ViLV^ll 1 ^ . 


242 


75 


1662 


gi382085 
7 


Euglena 


maturase-like protein 


81 


30 


1662 


gill 5596 
49 


Leuconostoc 
m essnteroi des 


dextransucrase Dsrb742 


79 


39 


1663 


gi 140177 
83 


Homo sapiens 


mRNA for KIAA1783 protein. 

nnrtial pHc 

JJul Ual Luai 


2287 


100 


1663 


AAU171 
93 


Homo sapiens 


HUMA- Novel signal 

frfiiniiHiir't irvn natnitra\r nrnfain 
LI UliOUUlAIUU JJaLllWciy pi ULCIu, 

Scq ID 758. 


1725 


100 


1663 


AAY578 
95 


Homo sapiens 


INCY- Human transmembrane 
protein HTMPN-1 9. 


1634 


100 


1664 


ml 04369 
92 




cL/iN/\. rLtJz \ \)\) ( us, ci one 
CAE03871. 




100 


1664 


gi 137849 
43 


Mus musculus 


Unknown (protein for 

iVHJV-f . 1 1 tOl ) 


2930 


84 


1664 


gi 178628 
68 


Drosophila 

mel anoff a*iter 


R£01471p 


308 


28 


1665 


AAG933 
18 


Homo sapiens 


NISC- Human protein HPI0505. 


465 


100 


1665 


gil63068 
68 


Homo sapiens 


mitochondrial ribosomal protein 
S2 1, clone MGC:2680 

complete cds. 


465 


100 


1665 


gi 136209 

n 


Homo sapiens 


MRPS21 mRNA for 

tttitocliondrial i*ihft<mm»l nrn^m 

S21, complete cds. 


465 


100 


1666 


AAU276 

52 


Homo sapiens 


ZYMO Human protein 
AFP213641. 


1484 


100 


1666 


gi 158624 
70 


Homo sapiens 


unnamed protein product 


1484 


100 


1666 


AAE0607 
1 


Homo sapiens 


HUMA- Human gene 31 
encoded secreted protein 
HBJLF01, SEQ ID NO: 133. 


1284 


100 


1667 


AAW781 

32 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 7 clone 
HPEBDB5. 


246 


100 


1668 


AAM502 
13 


Homo sapiens 


CURA- Human interleukin-1 1- 
like AMF7 C-terminal 
polypeptide. 


2219 


99 


1668 


gil65489 
21 


Homo sapiens 


unnamed protein product 


2219 


99 


1668 


AAB948 
03 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15937. 


1358 


100 


1669 


gi 120533 
65 


Homo sapiens 


mRNA; cDNA 

DKFZp586O0222 (from clone 


2747 


100 



217 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 


Hit ID 


Speicies 


Description 




Percent 
Identity 








DKFZp586O0222); complete 
cds. 


r 




1669 


gi936853 
8 


Homo sapiens 


mRNA full length insert cDNA 
clone EUROIMAGE 1987170. 


2687 1 


98 


1669 


gi996590 
5 


Mus musculus 


synembryn 


2383 I 
D 


86 


1670 


gil04402 
80 


Homo sapiens 


cDNA: FLJ23554 fts, clone 
LNG09359 


3757 


100 


1670 


gi!28552 
47 


Mus museums 


putative 


1339 


64 


1670 


gi 146026 
09 


Homo sapiens 


hypothetical protein FLJ23554, 
clone Mnr'14fififi 

VIV/llV JVlVJVi*. l^OUU 

IMAGE:3946091, mRNA, 
complete cds. 


1236 


99 


1671 


gil04417 
32 


Homo sapiens 


leucine-rich repeat-containing G 
protein-coupled receptor 6 

\i~>\jt\\jj imvjNrt^ ^atiiul Vila. 


4286 


99 


1671 


gil73828 
94 


Homo sapiens 


unnamed protein product 


3899 


93 


1671 


gil73828 
82 


Mus musculus 


unnamed protein product 


3477 


84 


1672 


AAB94I 
18 


Homo sapiens 


HELI* Human protein sequence 
<?po in no- 1 A'XfD 


1936 


100 


1672 


gi 104341 
08 


Homo sapiens 


cDNAFLJ 12552 fis, clone 
NT2RM4000712, moderately 

ubiquitin hydrolyzing enzyme I 
CUBHH mRNA 


1936 


100 


1672 


AAB958 
06 


Homo sapiens 


HELI- Human protein sequence 
SEO ID NO' 1 8793 


1930 


99 


1673 


AAB482 
93 


Homo sapiens 


UYYA Human ZF5 protein. 


1407 


80 


1673 


gi645611 
4 


Mus musculus 


F-box protein FBX16 


1407 


80 


1673 


ABB1559 
0 


Homo sapiens 


HUMA- Human nervous system 
related polypeptide SEQ ID NO 
4247. 


831 


90 


1674 


gi 100471 
63 


[Homo sapiens 


mRNA for KIAA1549 protein, 
partial cds. 


7563 


100 


1674 


AAM791 
57 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1819. 


948 


28 


1674 


AAM801 
41 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3787. 


941 


30 


1675 


gi 167686 
54 


Drosophila 
metanogaster 


HL01494p 


911 


39 


1675 


gi729229 
9 


Drosophila 
melanogaster 


CG1271 gene product 


888 


38- 


1675 


gi498199 
5 


Therm otoga 
maritima 


glycerol kinase 


846 


38 


1676 


gil2852S 
37 


Mus musculus 


putative 


892 


69 


1676 


gi322823 
7 


Homo sapiens 


UHS KerB gene. 


871 


71 


1676 


gi200962 


Mus musculus 


serine 1 ultra high sulfur protein 


827 


65 


1677 


gil73901 
82 


Homo sapiens 


clone IMAGE:4797244 J 
mRNA, partial cds. 


1694 


99 


1677 


AAY761 


Homo sapiens 


HUMA- Human secreted protein 


863 


99 



218 



WO 02/070539 



PCT/US02/05095 



SEQ 
U> 


Hit ID 


Speicics 


Description 


S score 


Percent 
identity 




77 




encoded by gene 54, & 






1677 


AAY043 
06 


Homo sapiens 


HUMA- Human secrete^ 
encoded by gene 14, 


protein 


328 


93 


1678 


gi568941 
7 


Homo sapiens 


mRNA for K1AA1040 pi 
partial cds. 


jtein, 


2793 


99 


1678 


gi 107286 
60 


Drosophila 
melanogaster 


CG8683 gene product 


2736 


48 


1678 


AAY023 
67 


Homo sapiens 


ONOY Polvnentide identifipd hv 

the signal sequence trap method, 


2663 


99 


1679 


gi 104399 
64 


Homo sapiens 


cDNA* FLJ23320 fk clone 
HEP12381. 


3605 


99 


1679 


AAG744 
99 


Homo sapiens 


HI /MA- Miinrififi rftlnri rntt^pr 

antigen protein SEQ ID 
NO* 526^ 


623 


95 


1679 


gil28306 
79 


Drosophila 
helvetica 


putative transposase 


220 


24 


1680 


gi!04382 
77 


Homo sapiens 


cDNA: FLJ22028 rls, clone 
HEP08589. 


2454 


100 


1680 


AAB736 
81 


Homo sapiens 


INC Y- Human oxidoreductase 
protein ORP-14. 


2337 


100 


1680 


gi729865 
9 


Drosophila 
melanogaster 


CG10721 gene product 


882 


43 


1681 


AAB943 
93 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 1 4957. 


3231 


99 


1681 


gi 104347 
65 


Homo sapiens 


cUNA vUlly/S lis, clone 
NT2RP2006023. 


3231 


99 


1681 


gil 28604 
50 


Mus museums 


putative 


1177 


64 


1682 


gi 104377 
48 


Homo sapiens 


cDNA:FLJ216l5fis, clone 
COL07393. 


876 


100 


1682 


gil3195l 
51 


Homo sapiens 


transcription factor TZP (TZP) 
mRNA, complete cds, 


362 


47 


1682 


gil024I4 
61 


Homo sapiens 


Human DNA sequence from 
clone RP5-1121G12on 
chromosome 20 Contains the 3' 

f*tlfl fif n frftn^ i^nnrtrlincr tlir* 
vllU yJl a gCIIC clUjUUlUg UK 

hepatocellular carcinoma- 

the SCANDt gene encoding the 
domain-containing 1 protein , a 
novel gene, 2 CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


362 


47 


1683 


AAB439 
00 


Homo sapiens 


HUMA- Human cancer 
associated protein sequence SEQ 
ID NO; 1 345. 


483 


87 


1683 


gil 68491 
7 


Homo sapiens 


Bruton's tyrosine kinase (BTK), 
alpha-D-galactosidase A (GLA), 
L44-like ribosomai protein 
(L44L) and FTP3 (F1T3) genes, 
complete cds. * 


483 


87 


1683 


gil 28047 
05 


Homo sapiens 


ribosomai protein L44, clone 
MGC:2064 IMAGE:3353669, 
mRNA, complete cds. 


483 


87 


1684 


AAY079 
31 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded From gene 80. 


213 


100 


1684 


gi593200 


Mus musculus 


neuronal apoptosis inhibitory 


68 


50 



219 
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SEQ 
ID 


Hit 10 


Speicies 


Description 


S score 


Percent 
identity 




3 


- t — 


protein-rs6 






1684 


gi593200 
8 


Mus musculus 


neuronal apoptosis inhibitory 
protein 


68 


50 


1685 


gil40304 
07 


Mus mu.fiulus 


keratin-associated protein 16.4 


380 


77 


1685 


gi 140304 
09 


Mus musSulus 


keratin-associated protein 16.5 


309 


64 


1685 


gi 140304 
01 


Mus musculus 


keratin-associated protein 16.1 


302 


67 


1686 


gil53417 
94 


Homo sapiens 


hypothetical protein FLJ12787, 
clone MGC: 16870 
IMAGE:3048487, mRNA, 
complete cds. 


1433 


100 


1686 


AAB951 
82 


Homo sapiens 


HE LI- Human protein sequence 
SEQ ID NO: 17250. 


1426 


99 


1686 


gil04344 
81 


Homo sapiens 


cDNA FLJ 12787 fis, clone 
NT2RP200I943. 


1426 


99 


1687 


AAY384 
01 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 1 6. 


230 


88 


1687 


AAB256 
87 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 23 
SEQ IDNO:76. 


66 


38 


1688 


gi 127449 
21 


Homo sapiens 


tethering factor SEC34 (SEC34) 
mRNA, complete cds. 


4223 


100 


1688 


git 45496 
69 


Homo sapiens 


vesicle docking protein SEC34 
mRNA, complete cds. 


4212 


99 


168B 


gil52915 
37 


Drosophila 
melanogaster 


GH25768p 


1691 


43 


1689 


AAB907 
46 


Homo sapiens 


GEMY Human DF989„3 protein 
sequence SEQ ID 192. 


545 


97 


1689 


AAW644 
71 


Homo sapiens 


GEMY Human secreted protein 
from clone DF989 3. 


545 


97 


1689 


gi282930 
2 


Homo sapiens 


mRNA for Efsl, complete cds. 


74 


37 


1690 


AAW136 
58 


Homo sapiens 


UYMC- Human cytidine 
deaminase. 


657 


87 


1690 


gi598149 


Homo sapiens 


cytidine deaminase (CD A) 
mRNA, complete cds. 


657 


87 


1690 


gi432l79 
3 


Homo sapiens 


cytidine deaminase gene, exon 4 
and complete cds. 


657 


87 


1691 


gi 126980 
79 


Homo sapiens 


mRNA for KIAA1767 protein, 
partial cds. 


4013 


99 


1691 


AAM255 
78 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1093. 


3955 


99 


1691 


AAE0618 
6 


Homo sapiens 


HUMA- Human gene 58 
encoded secreted protein 
fragment, SEQ IDNO:248. 


3521 


99 


1692 


gi255901 
0 


Homo sapiens 


chapcronin containing t»complex 
polypeptide 1, eta subunit (Ccth) 
mRNA, complete cds, 


890 


100 


1692 


gil41983 
88 


Mus musculus* 


chaperon in subunit 7 (eta) 


879 


98 


1692 


g!468504 


Mus musculus 


CCTeta, eta subunit of the 
chaperonin containing TCP- 1 
(CCT) 


879 


98 


1693 


gi 165520 

36 


Homo sapiens 


cDNA FLJ32028 fis, clone 
NTONG1000257. 


918 


100 
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1693 


AAB747 
68 


Homo sapiens 


HUM A- Human secreted protein 
sequence encoded by gene 13 
SEQ IDNO:77. 


592 


99 


1693 


AAB747 
45 


Homo sapiens 


HUMA- Human secreted protein 
sequence encoded by gene 13 
SEQ 1DN0:54. 


592 


99 


1694 


gi!00471 
57 


Homo sapiens 


mRNA for KIAA1546 protein, 
partial cds. 


3652 


100 


1694 


gil26978 
97 


Homo sapiens 


mRNA for KIAA1676 protein, 
partial cds. 


613 


39 


1694 


gi729228 
3 


prosophila 
melanogaster 


CG2083 gene product 


534 


35 


1695 


AAG000 
78 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 4159. 


164 


80 


1695 


gi237020 
2 


Homo sapiens 


mRNA for procollagen alpha 
2(V). 


164 


80 


1695 


gil 79698 


Homo sapiens 


Human collagen type V alpha-2 
mRNA, 5' end. 


164 


80 


1696 


gil 65525 
96 


Homo sapiens 


cDNA FU32466 fis, clone 
SKNMC2000065. 


2609 


99 


1696 


gil40178 
27 


Homo sapiens 


mRNA for K1AA1805 protein, 
partial cds. 


2609 


99 


1696 


gil 52079 
87 


Macaca 
fascicularfs 


hypothetical protein 


2588 


99 


1697 


AAB928 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11475, 


639 


73 


1697 


AAM414 

35 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6366. 


639 


73 


1697 


AAM396 
49 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2794, 


639 


73 


1698 


AAM254 
87 


Homo sapiens 


HYSE- Human protein sequence 
SEQ_IDNO:1002. 


586 


100 


1698 


AAG036 
67 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7748. 


582 


99 


1698 


gil28500 
50 


Mus musculus 


putative 


557 


93 


1699 


AAE0477 
4 


Homo sapiens 


INCY- Human vesicle 
trafficking protein- 1 7 (VETRP- 
17) protein. 


748 


100 


1699 


AAB4I6 

37 


Homo sapiens 


CURA- Human ORFX 
ORF140I polypeptide sequence 
SEQ IDNO:2802. 


748 


100 


1699 


gi33!995 
3 


Homo sapiens 


mRNA for TOM1 protein. 


638 


82 


1700 


gil4l401 
00 


Homo sapiens 


OTT gene for one twenty two 
proteins (spliced and unspliced 
forms). 


4797 


99 


1700 


AAB951 
11 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17093. 


4779 


99 


1700 


gi!4l613 
69 


Homo sapiens 


putative RNA-binding motif 
protein 15 short form (RBM15) 
mRNA, complete cds, 
alternatively spliced. 


4779 


99 


1701 


AAB938 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13792. 


2228 


100 


170! 


AAG667 
10 


Homo sapiens 


BIOD- Human cell growth 
inhibition protein 48. 


2228 


100 
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Tlfeft(*i*!r»f ioti 

JLMJO\*Jl S[J' HUM 




identity 


1701 


gil 44956 
27 


Homo sapiens 


clone MGC: 15047 
IMAGE:3535485, mRNA, 
complete cds. 


2228 


i no 


1702 


AAE1178 
0 


Homo sapiens 


1NCY- Human kinase (PKIN)- 
14 protein. 


4186 


100 


1702 


gii40418 
17 


Homo sapiens 


gklp mRNA for kinase- tike 
protein splice variant 1, 
complete cds. 


4186 


100 


1702 


AAB656 
79 


Homo sapiens 


SUGE- Move I orotein kinase 
SEQ ID NO: 207. 


4158 


07 


1703 


AAU122 
53 


Homo sanierm 


GFTH Human PR05774 

X X I X 1. LI 111 (111 X XY.VJ J II* 

polypeptide sequence. 






1703 


AAY307 
34 


Homo sanienQ 


HITN/IA — Am inn stf*irl cp/itiptipo 

of a human secreted protein. 




yo 


1703 


Eil 18166 
9 


Saecharom vne 1 ? 
cerevisiae 




7^ 


OA 


1704 


ei 104397 
62 


Hornrt saniftn 1 ? 


cDMA*FT T?^1fi4fif: rlnnp 
LNG09764. 




i nn 


1704 


ffi 104403 
12 


HKtlUXJ Mlf/lVll£> 


LNG12640. 


I too 


yo 


1704 


eil 26980 
85 


Homo ^anl^nt: 

X X\JHl\f tJtiJJltsllp 


mRNA for KTAA 177ft nrntpin 

partial cds. 


1 £1 A 


yy 


1705 


AAM418 
06 


Homo sapiens 


HYSE- Human polypeptide SEQ 


1194 


77 


1705 


AAM400 
20 


Homo sapiens 


HYSE- Human polypeptide SEQ 

Iks IN w Jl.\jJ. 


1194 


77 


1705 


ABB1223 
1 


Homo sapiens 


HYSE- Human novel protein, 
_SEQIDNO:2601. 


1194 


77 


1706 


65 




RTK"PW rniSJA 7^1ftft^0A1ft 

gene 


JUv i 


yi 


1706 


ei 128469 
32 




x/uian vp 




Ql 


1706 


AAB430 
28 


Homo sar>ien*i 


niRA- Human ORFY 

ORF2792 polypeptide sequence 
SEQ IDNO:5584, 


97J./; 


yj 


1707 


gil60411 

36 


Macaca 
fascicul&ris 


hypothetical protein 


702 


92 


1707 


AAB652 
16 


Homo sapiens 


GETH Human PRO1004 
(UNQ488) protein sequence 
SEQ IDNO:227. 


569 


92 


1707 


AAY666 
93 


Homo sapiens 


GETH Membrane-bound protein 
PRO1004. 


569 


92 


1708 


AAB956 
36 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 1 8369, 


2429 


100 


1708 


gil 04363 
57 


Homo sapiens 


cDNA FLJ 14009 fis, clone 
Y79AA1002431, weakly similar 
to TRANSDUCIN-LIKE 
ENHANCER PROTEIN 2. 


2429 


100 


1708 


gi503043 
9 


Homo sapiens 


chromosome 19, cosmid 
R26610, complete sequence. 


1569 


80 


1709 


gil57051 
43 


Mus nwsculus 


suppressor of cytokine signalling 
4 


2071 


86 


1709 


AAM007 
59 


Homo sapiens 


HYSE" Human bone marrow 
protein, SEQ ID NO: 122. 


1732 


100 


1709 


AAM008 
72 


Homo sapiens 


HYSE- Human bone marrow 
protein, SEQ ID NO: 348. 


1215 


99 


1710 


gil04387 


Homo sapiens 


cDNA: FLJ22408 fis, clone 


1934 


100 
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85 




HRC08416. 






1710 


AAM399 
17 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3062. 


837 * 


45 


1710 


AAM417 
03 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 6634. 


836 | 


45 


1711 


gil04374 
28 


Homo sapiens 


cDNA: FU2i343fi3, clone 
COL02679. 


1836^ 


100 


1711 


gil20531 
45 


Homo sapiens 


mRNA; cDNA 

DKF2p434A0926 (from clone 
DKFZp434A0926); complete 
cds. 


1388 


99 


1711 


gi568953 
7 


Homo sapiens 


mRNA for KIAA1 100 protein, 
complete cds. 


1261 


68 


1712 


AAG032 
54 


Homo sapiens 


GEST Human secreted protein, 
SEQJD'NQ; 7335. 


438 


89 


1712 


gi730038 
3 


Drosophila 
melanogaster 


CG7671 gene product 


366 


27 


1712 


AAB945 
51 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO; 153 12. 


309 


too 


1713 


gi 144956 
58 


Homo sapiens 


hypothetical protein FIJI 2687, 
clone MGC: 15791 
IMAGE:3504468, mRNA, 
complete cds. 


2639 


99 


1713 


AAB942 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14627. 


2631 


99 


1713 


gi 104343 
33 


Homo sapiens 


cDNA FU12687 fis, clone 
NT2RM4002532, weakly similar 
to PROTEIN HOMl. 


2631 


99 


1714 


AAC623 
Sljial 


Homo sapiens 


CELL- Nucleotide sequence of 
lysophosphatidic acid 
acyltransferase-beta. 


834 


100 


1714 


AAA392 
92_aal 


Homo sapiens 


CELL- Human lysophosphatidic 
acid acy) transferase beta 
encoding cDNA. 


834 


100 


1714 


AAB306 
23 


Homo sapiens 


CELL- Amino acid sequence of 
lysophosphatidic acid 
acyltransferase-beta. 


834 


100 


1715 


AAB958 
47 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 18896. 


675 


100 


1715 


gi 104367 
63 


Homo sapiens 


cDNA FLJ14326 fis, clone 
PLACE4000247. 


675 


100 


1715 


gi 165529 
00 


Homo sapiens 


cDNA FLJ327JI fis, clone 
TESTI2000707, weakly similar 
to DOUBLESEX PROTEIN, 
MALE-SPECIFIC. 


90 


35 


1716 


AAB530 
94 


Homo sapiens 


GETH Human angiogenesis- 
associated protein PR0826, SEQ 
ID NO: 1 58. 


278 


ioa 


1716 


AAB509 
16 


Homo sapiens 


GETH Human PR0826 protein. 


278 


100 


1716 


AAB652 
04 


Homo sapiens 


GETH Human PR0826 
(UNQ467) protein sequence 
SEQ ID NO:20I. 


278 * 


100 


1717 


AAB949 
15 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:16349. 


961 


100 


1717 


gil46026 
23 


Homo sapiens 


hypothetical protein FLJ11526, 
clone MGC: 15059 


961 


100 
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IMAGE:39$610, mRNA, 
complete cds* 






1717 


gi 104327 
97 


Homo sapiens 


cDNA FLJ1|d26 Us, clone 
HEMBA100|f>55, weakiy 
similar to Holio sapiens 
mSin3A associated polypeptide 
p30 mRNA. 


961 


100 


1718 


AAY124 
39 


Homo sapiens 


GEST Human 5* EST secreted 
protein SEQ IDNO:470. 


453 


04. 


1718 


ei 134771 
83 


Homo sar>ien*i 


Similar to hvnnthetiral nrotpin 
FLJ20859, clone MGC: 12940 
IMAGE:2822127, mRNA, 
complete cds. 


453 


Ql 
y*r 


1718 


gi 128308 
10 


Homo sapiens 


false p73 target protein gene^ 


453 


94 


1719 


gi 159289 
65 


Homo sapiens 


hypothetical protein FLJ1 1354, 
clone MGC;22961 

complete cds, 


3522 


100 


1719 


AARQ37 

OS 




HFI T— T-Tiimnn nmtpiti cpmipnrA 

X XLjL-tl ITUIllClIl UIUICIII jCULIdl^t? 

SEQIDNO:13299. 




oo 

yy 


1719 


95 


Haiti r* Qftnipno 


cDNA FT T1 l^^dfitj rlnnp 

HEMBA1000129, weakly 
similar tn HYPOTHFTIPAI 
HEL1CASE CBA4.08C IN 
CHROMOSOME 1. 




oo 


1720 


gi 147173 
96 


Hnmo sanien? 


exchanger NCKX3 (SLC24A3) 
inRNA, partial cds. 


3108 


07 

7 / 


1720 


gi 125974 
41 


Mus musculus 


K.+-d&fiSndftnt Na/f^a pxc^hannpr 


3027 


Zr*T 


1720 


gi!20003 
97 


Rattus 
norvegicus 


potassium-dependent sodium- 
calcium exchanger NCKX3 


3025 


94 


1721 


gi 150724 
54 


Mus musculus 


von Willebrand factor A- related 
protein 


1614 


72 


1721 


AAB425 
81 


Homo sapiens 


CUR A- Human ORFX 
ORF2345 polypeptide sequence 
SEQ 1DNO:4690. 


1358 


93 


1721 


AAB883 
40 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0053. 


1288 


98 


1722 


AAE0382 
2 


Homo sapiens 


HUMA- Human gene 5 encoded 
secreted protein HETKL27, SEQ 
ID NO: 68. 


935 


100 


1722 


AAB825 
97 


Homo sapiens 


HUMA- Human transmembrane 
protein encoded by cDNA clone 
HNALE36. 


935 


100 


1722 


gi 157064 
37 


Homo sapiens 


clone MGC: 17366 
IMAGE:3 860009, mRNA, 
complete cds. 


935 


100 


1723 


AAG665 
03 


Homo sapiens 


BIOD- Human ATfi-dependent 
helicase31. 


1441 


100 


1723 


AAM257 
80 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1295, 


1441 


100 


1723 


gil36763 
56 


Homo sapiens 


clone MGC:2679 
IMAGERS 1 9663, mRNA, 
complete cds. 


1434 


99 
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1724 


AAG673 
94 


Homo sapiens 


Amino acid sequence of 
Jiuman protein kinase SGK269. 


2322 


100 


1724 


gtl04371 
8L 


Homo sapiens 


IcDNA: FLJ2U40 fts, clone 
|bAS07548. 


3730 


100 


1724 


AAG673 
93 


Homo sapiens 


&UGE- Amino acid sequence of 
rmman protein kinase SGK223, 


952 


46 


1725 


AAF2449 
8 aal 


Homo sapiens 


GEST Human PG-3 coding 
sequence. 


4362 


99 


3725 


AAB354 
01 


Homo sapiens 


GEST Human PG=3, 


4355 


99 


1725 


gil33968 
64 


Homo sapiens 


unnamed protein product 


4355 


99 


1726 


AAB427 
84 


Homo sapiens 


CURA- Human ORFX 
ORF2548 polypeptide sequence 
SEQ IDNO;5096. 


817 


99 


1726 


gil22248 
87 


Homo sapiens 


mRNA; cDNA DKFZp547H027 
(from clone DKFZp547H027); 
complete cds. 


817 


99 


1726 


gil 04384 
59 


Homo sapiens 


cDNA: FLJ22174 fis, clone 
HRC00767. 


817 


99 


1727 


gil67405 
66 


Homo sapiens 


Similar to hypothetical protein 
FLJ13087, clone MGC: 15009 
1MAGE:3536735, mRNA, 
complete cds. 


1854 


too 


1727 


AAB952 
97 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17525. 


807 


95 


1727 


gi 104349 
41 


Homo sapiens 


cDNA FLJ 13087 fis, clone 
NT2RP3002099. 


807 


95 


1728 


AAB940 
75 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:14267. 


3506 


97 


1728 


AAM939 
95 


Homo sapiens 


HELL- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 59. 


3506 


97 


1728 


gil40421 
45 


Homo sapiens 


cDNA FLJ14550 fis, clone 
NT2RM2001696. 


3506 


97 


1729 


AAG008 
97 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO; 4978. 


392 


93 


1729 


AAG008 
98 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO; 4979. 


130 


100 


1729 


AAB425 
97 


Homo sapiens 


CURA- Human ORFX 
ORF236I polypeptide sequence 
SEQIDNO:4722, 


130 


100 


1730 


ABB116I 
3 


Homo sapiens 


HYSE- Human sorting nexin 7 
homologue, SEQ ID NO: 1983. 


2341 


100 


1730 


AAG741 
74 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein SEQ ID 
NO:4938, 


2309 


99 


1730 


gi488424 
1 


Homo sapiens 


mRNA; cDNA DKFZp564F052 
(from clone DKFZp564F052); 
partial cds. 


2148 


99 


1731 


gil34456 
60 


Homo sapiens 


MPlfr(LIM2) mRNA, complete 
cdsj alternatively spliced. 


933 


100 


1731 


gill 1775 
46 


Homo sapiens 


LIM2 (LIM2) and natural killer 
group 7 (NKG7) genes, 
complete cds. 


933 


100 


1731 


gil 34456 
58 


Homo sapiens 


MP19ins (LIM2) mRNA, 
complete cds, alternatively 


880 


80 
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spliced. 






1732 


AAB405 
91 


Homo sapiens 


CURA- Human ORFX ORF355 
polypeptide sequence SEQ ID 
NO:710, 


1056 


too 


1732 


gil56174 
58 


Oryctolagus 
cuniculus 


Rabl 1 family interacting protein 


869 


48 


1732 


AAY294 
88 


Homo sapiens 


CORI- Human lung tumour 
protein LT86-7 predicted amino 
acid sequence, 


557 


87 


1733 


gil 04377 
50 


Homo sapiens 


cDNA: FLJ2I616 fis, clone 
COL07477. 


1680 


99 


j 733 


gil 43493 
60 


Homo saDiens 

*<*VillV wilt/ 


Sim ilar to hvnothfiticnl nrntfiin 

FLJ21616, clone MGC:14941 
MAGE: 3947903, mRNA, 
complete cds. 


1443 


99 


1733 


gil28054 
73 


Mus tnusculus 


Unknown (protein for 
IMAGE:3490304) 


1410 


97 


1734 


gil 04377 
50 


Homo sapiens 


cDNA: FLJ21616 fis, clone 
COL07477. 


1645 


92 


1734 


gil 43493 
60 


Homo sapiens 


Similar to hypothetical protein 
FLJ21616, clone MGC:14941 
IMAGE:3947903, raRNA, 
complete cds. 


1565 


99 


1734 


gil 28054 
73 


Mus musculus 


Unknown foroteln for 
IMAGE:3490304) 


1412 


96 


1735 


AAB950 
36 


Hnmn ^aniRn^ 

1 1-\J 111 VJ d£4L/l Vii^ 


IXLiUt .llUlIltfLf Ul yluJlf SCUUUIVu 

SEQ ID NO: 16791. 


8fii 


1flO 


1735 


gil 04334 
48 


Homo sapiens 


cDNA FLJ 12060 fis, clone 
HEMBB 1002 142. 


863 


100 


1735 


gil53419 
04 


Homo sapiens 


clone MGC:21051 
IMAGE: 44768 86, mRNA, 
complete cds. 


751 


99 


1736 


gil 40437 
83 


Homo sapiens 


clone MGC: 14256 
IMAGE:4129368 f mRNA, 
complete cds. 


2232 


100 


1736 


gil 04368 
57 


Homo sapiens 


cDNA: FLJ20897 fis, clone 
ADKA03573. 


2232 


100 


1736 


gil26537 
85 


Homo sapiens 


clone IMAGE:3349601 S 
mRNA, partial cds. 


1783 


99 


1737 


gil 42498 
50 


Homo sapiens 


clone MGC: 1 5062 
IMAGE:2959567, mRNA, 
complete cds. 


1535 


99 


1737 


AAM795 
39 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3185. 


1523 


53 


1737 


AAM785 
55 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1217. 


1523 


53 


1738 


gil31833 
38 


Homo sapiens 


calneuron 1 (CALN1) mRNA, 
complete cds. 


881 


100 


1738 


g(131833 
40 


Mus tnusculus 


calneuron 1 


880 


99 


1738 


gi767034 
4 


Mus musculus 


unnamed protein product 


880 


99 


1739 


gi499592 
7 


Xenopus laevis 


p33 ringo 


545 


49 


1739 


gil 02802 
85 


Xenopus sp. 


unnamed protein product 


545 


49 


1739 


gi446879 


Xenopus laevis 


speedy protein 


540 


48 



226 



WO 02/070539 



PCT/US02/05095 



SEQ 
ID 


Hit ID 


Spcicics 


Description 


S score 


Percent 
identity 




0 










1740 


gil52815 
53 


Homo sapiens 


solute carrier family 12 member 
8 (SLC12A8) gene, partial cds. 


1905 


80 


1740 


gil 72249 
40 


Mus musculus 


cation-chloride cotransporter 


1761 


74 


1740 


gil 04397 
94 


Homo sapiens 


cDNA: FLJ23188 fis, clone 
LNG12038. 


1613 


99 


1741 


gi633016 

3 


Homo sapiens 


mRNA for K1AA1 161 protein, 
partial cds. 


1137 


99 


1741 


gi730133 
3 


Drosophila 
melanogastcr 


CGI 1909 gene product 


485 


41 


1741 


gi64404 


Torpedo 
ealiforrnea 


4-acetamldo-4- 

disulphonic acid-binding protein 


415 


38 


1742 


AAM678 
57 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SFO ID NO- 28163 


1553 


100 


1742 


AAM554 
71 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 


1553 


100 


1742 


gil 56208 
49 


Homo sapiens 


mRNA for K1AA1895 protein, 
partial cds. 


1553 


100 


1743 


0 


liumu sapiens 


nunc ajojo unitnown mrviN/v, 
partial cds. 






1743 


gil 33252 

O I 


Homo sapiens 


hypothetical protein MGC2683, 

IMAGE:28 19900, mRNA, 
complete cds 


700 


100 


1743 


gil 26544 
85 


Homo sapiens 


clone MGC:2683 
IMAGF'2819900 mRNA 

complete cds. 


700 


100 


1744 


fill 393 83 
07 


Homo sapiens 


clone MGC* 15626 
IMAGE;3343642, mRNA, 
complete cds. 


526 


62 


1744 


AAB907 
65 


Homo sapiens 


NOJI/ Human shear stress- 
response protein SEQ ID NO: 
30. 


524 


64 


1744 


AAB621 
59 


Homo sapiens 


NEUR- Human arginine-rich 
protein. 


524 


64 


1745 


gi724320 
7 


Homo sapiens 


mRNA for K1AA1413 protein, 
partial cds. 


7273 


99 


1745 


AAB930 
57 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11 862, 


3014 


99 


1745 


gi702286 
1 


Homo sapiens 


cDNA FLJ10680 fis, clone 
NT2RP2006573, weakly similar 
to2',3'-CYCLIC NUCLEOTIDE 
^-PHOSPHODIESTERASE 
(EC 3.1.4.37). 


3014 


99 


1746 


gi724320 
7 


Homo sapiens 


mRNA for KIAA1413 protein, 
partial cds. 


7245 


98 


1746 


AAB930 
57 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11862. 


3014 


99 


1746 


gi702286 
1 


Homo sapiens 


cDNA FU1068O fis, clone 
NT2RP2006573, weakly similar 
to 2',3'-C YCLIC NUCLEOTIDE 
3'-PHOSPHODIESTERASE 
(EC 3.1.4.37). 


3014 


99 
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Percent 
identity 


1747 


AAB1033 
0 


Homo sapiens 


INCY- Human transporter and 
ion channel-7 (TRICM-7) 
protein. 


152& 


100 


1747 


AAM394 
22 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2567. 


694 | 


48 


1747 


AAM793 
97 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3043. 





48 


1748 


gi 104397 
44 


Homo sapiens 


cDNA:ELJ23I51 fis, clone 
LNG094I7. 


2362 


100 


1748 


gi 128604 
56 


Mus musculus 


putative 


1732 


70 


1748 


gi895406 
3 


Arabidopsis 
thaliana 


Contains similarity to a 
transposable element TiplOO 
urotein for transDOsase from 
Ipomoea purpurea gb|4063769 
and is a member of the 
transmembrane 4 family 
PF|00335. 


308 


24 


1749 


gil75299 
87 


Homo sapiens 


oxvsterol-bindiiiP" nrotpin-Iikp 
protein OSBPL3 (OSBPL3) 
mRNA, complete cds. 


4.A7 1 

*TVJ / I 


i on 


1749 


gil73893 
82 


Homo sapiens 


oxysterol binding protein-like 3, 
clone MGC:21526 
IMAGE:3909164, mRNA, 
complete cds* 


4671 


100 


1749 


gil08809 
73 


Homo sapiens 


oxysterol binding protein-related 
protein 3 (ORP3) mRNA, 
complete cds. 


4671 


100 


1750 


AAB948 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 16094. 


2813 


100 


1750 


gil04363 
38 


Homo sapiens 


cDNA FLJ13998 fis, clone 
Y79AA 1002229, weakly similar 
to DNA CROSS-LINK REPAIR 
PROTEIN PS02/SNM 1 . 


2813 


100 


1750 


AAB943 
19 


Homo sapiens 


HELI" Human protein sequence 
SEQ IDNO:14797. 


2330 


99 


1751 


AAB193 
90 


Homo sapiens 


LEXI- Amino acid sequence of a 
human lipoxygenase protein. 


3836 


100 


1751 


gil33781 
70 


Homo sapiens 


partial ALOXE3 gene for 
arachidonate lipoxygenase 3, 
exons 1 to 4B (and joined CDS). 


3836 


100 


1751 


gil04410 
04 


Homo sapiens 


epidermal lipoxygenase 
(ALOXE3) mRNA, complete 
cds. 


3830 


99 


1752 


AAM932 
41 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 2671, 


664 


95 


1752 


gi 150302 
70 


Homo sapiens 


clone MGC:9889 
IMAGE:3868330> mRNA, 
complete cds. 


664 


95 


1752 


AAO102 
85 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 24177. 


486 


78 


1753 


gi456191 


Homo sapiens 


Rsapiens mRNA for rho GDP- 
dissociation Inhibitor 1. 


818 


99 


1753 


gi337395 


Homo sapiens 


Human GDP dissociation 
inhibitor mRNA, complete cds. 


818 


99 


1753 


gi285979 


Homo sapiens 


Human rho GDI mRNA, 
complete cds. 


SIB 


99 
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S score 
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identity 


1754 


AAQ435 
49 jut I 


Homo sapiens 


HARD Gamma subunitof 
human retinal cGMP | 
phosphodiesterase DN A 


472 


100 


1754 


AAR931 
17 


Homo sapiens 


HARD cGMP- | 
phosphodiesterase gamma- 
subunit. 


472 


100 


1754 


AAR384 
84 


Homo sapiens 


HARD Gamma subunit of 
human retinal cGMP 
phosphodiesterase. 


472 


100 


1755 


AAY194 
46 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


590 


98 


1755 


AAYI95 
99 


Homo sapiens 


HUMA- SEQ ID NO 3 1 7 from 
W09922243. 


590 


98 


1755 


AAY196 
02 


Homo sapiens 


HUMA- SEQ ID NO 320 from 
W09922243, 


137 


100 


1756 


gi 104373 
93 


Homo sapiens 


cDNA: FLJ21313 fls, clone 
COL02176. 


2197 


99 


1756 


gi 142503 
21 


Homo sapiens 


hypothetical protein FU21313, 
clone MGC: 16820 
IMAGE:4148772, mRNA, 
complete cds. 


2193 


99 


1756 


gi 128585 
26 


Mus musculus 


putative 


1936 


86 


1757 


gi741435 


Homo sapiens 


Hoxl.8 gene for homeobox 
protein. 


478 


100 


1757 


£(278967 
2 


Homo sapiens 


homeobox protein A 10 
(HOXA10) gene, complete cds. 


478 


100 


1757 


gi 155592 
35 


Homo sapiens 


clone MGC: 12859 
IMAGE:41070i3, mRNA, 
complete cds* 


478 


100 


1758 


gil 89772 


Homo sapiens 


Human prostaglandin D2 
synthase gene, exons 2 through 6 
and complete cds. 


872 


90 


1758 


gi 135435 
68 


Homo sapiens 


prostaglandin D2 synthase 
(21kD, brain), clone 
MGC:14559 IMAGE:4294999, 
mRNA, complete cds. 


872 


90 


1758 


gil 29638 
79 


Homo sapiens 


prostaglandin D synthase 
mRNA, complete cds. 


872 


90 


1759 


gi298250 
8 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


1296 


92 


1759 


gi300292 
5 


Homo sapiens 


T cell receptor beta chain 
(TCRB V 1 3S 1 -TCRBJ2S 1) 
mRNA, complete cds. 


1286 


92 


1759 


gi36733 


Homo sapiens 


H.sapicns mRNA for T-cell 
antigen receptor beta-chain. 


1047 


75 


1760 


AAM257 
28 


Homo sapiens 


HYSE- Human protein sequence 
SEQ ID NO: 1243. 


667 


99 


1760 


gil 55290 
64 


Homo sapiens 


sorting nexin 14(SNXI4) 
mRNA, complete cds. 


667 


99 


1760 


gil 34772 
73 


Homo sapiens 


clone MGC: 1321 7 * 
IMAGE:3959086, mRNA, 
complete cds. 


667 


99 


1761 


AAB948 
43 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:160Ifc 


1761 


100 


1761 


gil 04362 
67 


Homo sapiens 


cDNA FU13955 fis, clone 
Y79AA1001177. 


1761 


100 
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1761 


gi!63?92 


Mus musculus 


Similar to hypothetical protein 
FU 13955 


1681 


85 


1762 


AAEOS05 

8 1 


Homo sapiens 


HUMA- Human four disulfide 
core domain (FDCD)-containing 
protein. 


1200 


84 


1762 


gU283?3 
76 


Mus musculus 


putative 


918 


68 


1762 


gi!26554 
52 


Homo sapiens 


mRNA for keratin associated 
protein 4.7 (KRTAP4J gene). 


892 


68 


1763 


AAG674 
85 


Homo sapiens 


LEXI- Amino acid sequence of a 
human transporter protein, 

r . , , ..C _ ^ — 


2391 


99 


1763 


AAE1033 

3 


Homo sapiens 


INC Y- Human transporter and 

ionchannel-10(TRlCH-10) 

protein. 


2368 


96 


1763 


gi 1271 82 
01 


Homo sapiens 


Human DNA sequence from 
clone RP11-305P22 on 
chromosome 20 Contains ESTs, 
STSs, GSSs and 7 CpG islands. 
Contains three novel genes and a 
novel gene for a helix- loop-helix 
DNA binding protein, complete 
sequence. 


2243 


100 


1764 


gt 157789 
48 


Homo sapiens 


Similar to thiamine 
pyrophosphokinase, clone 
MGC:14885 IMAGE:36221 16, 
mRNA 5 complete cds. 


717 


100 


1764 


gi 126672 
03 


Homo sapiens 


thiamine pyrophosphokinase 
(TPK1) mRNA, complete cds. 


717 


100 


1764 


gi 122489 
15 


Homo sapiens 


hTPItl mRNA for thiamin 
pyrophosphokinase, complete 
cds. 


717 


100 


1765 


gil04388 
31 


Homo sapiens 


cDNA: FLJ22439 fis, clone 
HRC09236. 


2525 


99 


1765 


AAB422 
37 


Homo sapiens 


CURA- Human ORPX 
ORF200I polypeptide sequence 
SEQ IDNO:4002. 


2084 


99 


1765 


gi 135592 
84 


Homo sapiens 


Human DNA sequence from 
clone RP5-H75I6on 
chromosome 20. Contains the 3' 
end of the gene for Ras inhibitor 
JC265 (Ras association 
(RalGDS/AF-6) domain 
containing protein), the 5' end of 
the gene encoding N-terrninal 
acetyltransferase complex ardl 
sub-unit, ESTs, STSs, GSSs and 
two CpG islands, complete 
sequence. 


829 


36 


1766 


gi958842 
8 


Homo sapiens 


Human DNA sequence from 
clone RP5-1024N4on 
chromosome lp32.1~33. 
Contains the gene for a novel 
Sodium :solute symporter family 
member similar to SLC5A1 
(SGLT1), a pseudogene similar 
to part of butyrophilin family 
members, a novel gene, ESTs, 
STSs, GSSs and a putative CpG 


2858 


100 
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island, complete sequence. 






1766 


gi529056 


Homo sapiens 


Na t /glucose cotransporter 
(SGLT1) gene, exon 15 and 
complete cds. 


1955 


55 


1766 


gi364604 
3 


Homo sapiens 


Human DNA sequence from 
PAC 127L4 on chromosome 22. 
Contains last exon (15) of the 
SLC5A1 (SGLT1) gene for 
solute carrier family 5 
(sodium/glucose cotransporter) 
member 1 (High Affinity 
Sodium-Glucose Cotransporter). 
Contains ESTs and STSs, 
complete sequence. 


1955 


55 


1767 


AAB734 
85 


Homo sapiens 


MILL- Human aminopeptidase 
22196. 


3657 


99 


1767 


gil25836 
12 


Homo sapiens 


mRNA for neurolysin. 


3657 


99 


1767 


gi!39224 
67 


Homo sapiens 


unnamed protein product 


3657 


99 


1768 


gil0l221 
38 


Rattus 
norvegicus 


SynGAP-a 


6651 


99 


1768 


gi293544 
8 


Rattus 
norvegicus 


synaptic ras GTPase-activating 
protein pi 35 SynGAP 


6634 


99 


1768 


gi662458 
7 


Homo sapiens 


Human DNA sequence from 
clone RP4-570F3 on 
chromosome 6. Contains the 5' 
end of the gene for the ortholog 
of the rat synaptic ras GTPase- 
activating protein pi 35 
SynGAP, gene LOC51596 for 
divalent cation tolerant protein 
CUTA or brain 
acetylcholinesterase nutatlve 
membrane anchor* the PHF1 
gene for PHD finger protein 1 , 
the KNSL2 gene for kinesin-like 
protein 2, the gene for a novel 
protein similar to ribosomal 
protein LI 2 (RPL12) and the 
gene for a novel protein similar 
to lysophospholipase II 
(LYPLA2). Contains ESTs, 
STSs, GSSs and four CpG 
islands^ complete sequence. 


6194 


100 


1769 


AAB958 
63 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 1 893 U 


1040 


100_ 


1769 


gil73892 
83 


Homo sapiens 


hypothetical protein FL/14346, 
clone MGC-21027 
IMAGE:44I5420, mRNA, 
complete cds. 


1040 


100 


1769 


gil04367 
91 


Homo sapiens 


cDNAFLJ14346fis, clone 
THYRO1001320. 


1040 


100 


1770 


AAB945 
17 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:15m 


2593 


99 


1770 


gil04351 

22 


Homo sapiens 


cDNA FLJ13203 Fis, clone 
NT2RP3004504, highly similar 
to M.musculus mRNA for CPEB 


2593 


99 
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protein. 




V.-fc„ 


1770 


gil26592 
16 


Homo sapiens 


cytoplasmic poJyadenyJation 
element-binding protein short 
form (CPEB1) mRNA, complete 
cds. 


2583 


98 | 


1771 


AAB480 
59 


Homo sapiens 


INCY- Human extracellular 
signaling molecule (EXCS) (ID 
1493630CD1). 


432 


97 4 


1771 


AAY360 
90 


Homo sapiens 


GEST Extended human secreted 
protein sequence, SEQ ID NO. 
475. 


420 


95 


1771 


AAY117 
68 


Homo sapiens 


GEST Human 5' EST secreted 
protein SEQ ID No: 368. 


257 


95 


1772 


AAB930 
75 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11902, 


2936 


99 


1772 


gi 140424 
15 


Homo sapiens 


uDNA FLJ14710 fis, clone 
NT2RP3000632, weakly similar 
to ZINC FINGER PROTEIN 84. 


2936 


99 


Mil 


gil65514 
29 


Homo sapiens 


cDNA FLJ31551 fis, clone 
NT2RI2001083, moderately 
similar to ZINC FINGER 
PROTEIN 84. 


1813 


63 


1773 


AAY768 
43 


Homo sapiens 


INCY- Human proton ATPase 
subunit (HP AS) protein 
sequence. 


356 


100 


1773 


AAY885 
90 


Homo sapiens 


SATO/ Human tumour specific 
antigen amino acid sequence. 


356 


100 


1773 


AAW645 
34 


Homo sapiens 


SAGA Human fibrosarcoma cell 
line HT-1080 clone HP00442 
protein. 


356 


100 


1774 


gi 124073 
85 


Homo sapiens 


tripartite motif protein TRIM5 
isoform gamma (TRIM5) 
mRNA, complete cds; 
alternatively spliced. 


1818 


99 


1774 


gi 124073 
87 


Homo sapiens 


tripartite motif protein TRIM5 
isoform delta (TRIM 5) mUNA, 
complete cds; alternatively 
spliced. 


1559 


99 


1774 


gi!24073 
83 


Homo sapiens 


tripartite motif protein TRIM5 
isoform beta (TRIMS) mRNA, 
complete cds; alternatively 
spliced. 


1557 


100 


1775 


gil42506 
01 


Homo sapiens 


hypothetical protein FLJ22056, 
clone MGC:3045 
IMAGE:3343082, mRNA, 
complete cds. 


2600 


99 


1775 


gi!04383 
15 


Homo sapiens 


cDNA: FLJ22056 fis, clone 
HEP09916. 


1747 


100 


1775 


gi729529 
3 


Drosophila 
melanogaster 


CG8633 gene product 


597 


31 


1776 


gi 104370 
59 


Homo sapiens 


cDNA: FLJ21054 fis, clone 
CAS00538. 


736 


100 


1776 


gill5580 
99 


Mus museums 


syncoilin 


653 


89 


1776 


gi 128354 
05 


Mus musculus 


putative 


552 


84 


1777 


gi 163076 


Homo sapiens 


hypothetical protein FLJ 12270, 


3045 


100 
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08 




cloneMGC:10176 * 

JlVlJAVJli, J?UOUUt, lHlvl>rtj ^ 

complete cds. _ k 






1777 


gil56209 


Homo sapiens 


mRNA for KJAA1923 protein! 


3042 


99 


1777 


AAB939 


Homo sapiens 


HELI- Human protein sequencS* 


2190 


100 


1778 


gi795981 
o 

V 


Homo sapiens 


PRO1430 


299 


100 


1778 


AAO078 

j| A 


Homo sapkns 


HYSE- Human polypeptide SEQ 


75 


54 


1778 


AAO088 

70 


Homo sapiens 


HYSE- Human polypeptide SEQ 


73 


43 


1779 


AAG629 

10 


Homo sapiens 


KLEE/ Amino acid sequence of 
a human xylosylytransferase 
(XT) isoform XT-1I. 


4625 


100 


1779 


gill 3222 
70 


Homo sapiens 


mRNA for xylosyltransferase 11 
(XT-1I gene). 


4625 


100 


1779 


gi 152096 
53 


Homo sapiens 


human XI -11 






1780 


gi992997 
3 


Macaca 
fascicularis 


hypothetical protein 


1778 


96 


1780 


AAG787 
40 


Homo sapiens 


BODE- Human transcriptional 
elongation factor IIS 24. 


1162 


99 


1780 


ABB 11 22 

0 


Homo sapiens 


HYSE- Human TFJISh 
nomologue, SEQ ID NO;J5y0. 


653 


100 


1781 


gtl52778 
46 


Homo sapiens 


Similar to hypothetical protein 
FLJ21522, clone MGC;16817 
IMAGE:3853503, mRNA, 
complete cds. 


3122 


99 


1781 


AAB643 
72 


Homo sapiens 


INC Y- Amino acid sequence of 
human intracellular signalling 
molecule INTRA4, 


3100 


99 


1781 


gi 1043 76 
38 


Homo sapiens 


ct)MA: rLJZlDll ris, clone 
COL05884, 






t TOO 


AAoiHO 
25 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO:15483. 


^) Ail ft 


inn 


1/0/ 


87 


Homo sapiens 


clmna rJLJ ojiso lis, cione 
PLACE1001 104, weakly similar 
to MYOSIN HEAVY CHAIN, 


ZOoo 


inn 


1782 


gil65518 

IT 
1 / 


Homo sapiens 


cDNAFLJ31903fis, clone 

TJT9RP7nrtd'>/ift ivpalrlv similar 

IN 1 iI\T / UU*+jiU\/j WCaMy e>U)IJltll 

to MYOSIN HEAVY CHAIN, 
NONMI JSC! E TYPE ft 


2429 


92 


1783 


gil05681 
12 


Homo sapiens 


ALR-Iike protein mRNA, 
complete cds. 


17050 


100 


1783 


gi563007 
7 


Homo sapiens 


PAC clone RP5-98107 from 
7q34-q36, complete sequence. 


9606 


100 


1783 


AAB422 
30 


Homo sapiens 


CURA- Human ORFX 
ORP1994 polypeptide sequence 
SEQ IDNO:3988, 


9583 
* 


99 


1784 


AAB688 
76 


Homo sapiens 


INCY- Human RECAP 
polypeptide, SEQ ID NO: 6. 


1895 


85 


1784 


gi!04381 
74 


Homo sapiens 


cDNA: FU21952 fis t clone 
HEP04970. 


1895 


85 


1784 


gi 169242 


Homo sapiens 


hypothetical protein FU21952, 


1862 


84 
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96 


-r 

I 


clone MGC:2790 
1MAGE;2960984, mRNA, 
complete cds* 






1785 


AAU160 
26 


Homo saafens 


HUM A- Human novel secreted 
protein, Seq ID 979, 


1433 


100 


1785 


gil40l78 
23 


Homo sapfens 


mRNA for KIAA1803 protein, 
partial cds. 


1433 


100 


1785 


AAG780 
55 


Homo sapiens 


GEAT Human zinc finger 
domain DNA binding protein S 
1-3. 


1168 


99 


1786 


AAG004 
05 


Homo sapiens 


GEST Human secreted protein, 
SEO ID NO 4486 


397 


98 


1786 


AAM906 
02 


Homo sapiens 


HUMA- Human 
immunfi/haematonoietic antigen 
SEQ ID NO: 18195* 


215 


56 


1786 


AAM41 1 
14 




HYSF- Human nolvnentide SPO 
ID NO 6045. 


84 




1787 


AAG020 
95 


Homo sapiens 


GEST Human secreted protein, 
SEO ID NO- 6176 


302 


100 


1787 


gi665l03 
7 


Mus musculus 
dom e*tticu«3 


similar to RNA binding protein 


222 


53 


1787 


giI28478 
83 


Mus musculus 


putative 


222 


53 


1788 


gt979845 

2 


Homo sapiens 


mRNA for putative capacitative 
calcium channel (trp7 £>ene). 


4470 


100 


1788 


gi5326B5 
4 


Mus musculus 


receptor-activated calcium 


4392 


98 


1788 


gi229590 
3 


Homo sapiens 


Human putative calcium influx 
channel (hlrr)3} mRNA 
complete cds. 


3529 


81 


1789 


AAG023 
37 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6418. 


377 


98 


1789 


AAM008 
75 


Homo sapiens 


HYSE- Human bone marrow 
protein, SEQ ID NO: 351. 


72 


31 


1789 


ei 166669 

2 


Mus musculus 


alpha^NAC, muscle-specific 
form gp220 


72 


34 


1790 


AAB733 
81 


Homo sapiens 


NANF- Human gas vesicle 
protein homologue hGvpT-b* 


2838 


99 


1790 


gil20055 
09 


Homo sapiens 


HT025 mRNA, complete cds. 


2838 


99 


1790 


gi!73914 
58 


Homo sapiens 


clone MGC-2462 
IMAGE:2964737, mRNA, 
complete cds. 


1699 


99 


1791 


AAY994 
38 


Homo sapiens 


GETH Human PR01555 
(UNQ763) amino acid sequence 
SEQ IDNO:33S. 


1300 


100 


1791 


AAB240 
37 


Homo sapiens 


GETH Human PR01555 protein 
sequence SEQ ID NO:49. 


1300 


100 


1791 


gi 126542 
33 


Homo sapiens 


Similar to hypothetical protein, 
clone 1-2, clone MGC;5442 
IMAGE:3449979, mRNA, 
complete cds. 


1300 


100 


1792 


AAB638 
60 


Homo sapiens 


LUDW- Human prostate cancer 
associated antigen protein 
sequence SEQ ID NO: 1222. 


297 


50 


1792 


AAM767 
71 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 


272 


51 
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r~ 




SEQ ID MO: 37077. 






1792 


AAM63tf 


Homo sapiens 


MOLE- Human brain expressed 
single exon probe encoded 
protein SEQ ID NO: 36056. 


272 


51 


1793 


AAB942| 
49 • 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14645. 


799 


99 


1793 


gil04343 
56 


Homo sapiens 


cDNA FLJ 12700 fis, clone 
NT2RP1 000721. 


799 


99 


1793 


ABB 1242 
2 


Homo sapiens 


HYSE- Human bone marrow 
expressed protein SEQ ID NO: 
261. 


528 


99 


1794 


AAB651 
89 


Homo sapiens 


GETH Human PRO1013 
(UNQ496) protein sequence 
SEQ IDNO:158. 


1858 


99 


1794 


AAB875 
36 


Homo sapiens 


GETH Human PRO1013. 


1858 


99 


1794 


AAY666 
66 


Homo sapiens 


GETH Membrane-bound protein 
PRO1013, 


1858 


99 


1795 


AAB651 
89 


Homo sapiens 


GETH Human PRO1013 
(UNQ496) protein sequence 
SEQ IDNOU58. 


1655 


93 


1795 


AAB875 
36 


Homo sapiens 


GETH Human PRO1013. 


1655 


93 


1795 


AAY666 
66 


Homo sapiens 


GETH Membrane-bound protein 
PRO1013. 


1655 


93 


1796 


gi777026 
3 


Homo sapiens 


PRO3077 


620 


100 


1796 


gi 15891 6 

32 


Agrobacteriurn 
tumefaciens 


AGR_L_3035p 


67 


31 


1797 


AAY362 
33 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 10. 


302 


96 


1797 


gi298307 


Rattus sp. 


beta 3-adrenergic receptor; beta 
3-AR 


83 


38 


1797 


R&41216 


Rattus sp. 


beta 3-adrenergic receptor 


83 


38 


1798 


ABB1203 

7 


Homo sapiens 


HYSE- Human ribosomal 
protein L31 homologue, SEQ ID 
NO:2407. 


341 


100 


1798 


AAG038 
94 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7975. 


341 


100 


1798 


AAB437 
07 


Homo sapiens 


HUMA* Human cancer 
associated protein sequence SEQ 
IDNO:1152. 


341 


100 


1799 


AAE1099 
5 


Homo sapiens 


INCY- Human lipid metabolism 
enzyme-4 (LME-4) protein. 


2242 


99 


1799 


AAB419 
89 


Homo sapiens 


CURA- Human ORFX 
ORP1753 polypeptide sequence 
SEQ ID NO:3506. 


2224 


98 

- 


1799 


AAB940 
07 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:14123. 


1212 


99 


1800 


AAY761 
94 ■ 


Homo sapiens 
• 


HUMA- Human secreted protein 
encoded by gene 71. 


296 


98 


1800 


AAY131 
96 


Homo sapiens 


GEST Human secreted protein 
encoded by 5* EST SEQ ID NO: 
210. 


291 


96 


1800 


AAY194 
71 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


148 


96 


1801 


ABB 1242 


Homo sapiens 


HYSE- Human bone marrow 


2841 


99 
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2 




expressed protein SEQ ID NO: 
261. 






1801 


gi363S95 
6 


Homo sapiens 


PAC clone RP4-751H13 from 
7q35-qter, complete sequence. 


2830 


100 


1801 


gi767049 
6 


Mus musculus 


unnamed protein product 


2352 


84 - 


1802 


AAU123 
82 


Homo sapiens 


GETH Human PR0792 
polypeptide sequence. 


1332 


86 


1802 


AAB244 
16 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ ID NO: 155. 


1332 


86 


1802 


AAB240 
55 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ ID NO:3 1 . 


1332 


86 


1803 


AAU123 
82 


Homo sapiens 


GETH Human PR0792 
polypeptide sequence. 


1590 


100 


1803 


AAB244 
16 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ ID NO: 1 55. 


1590 


100 


1803 


AAB240 
55 


Homo sapiens 


GETH Human PR0792 protein 
sequence SEQ ID NO:31. 


1590 


100 


1804 


AAB437 
13 


Homo sapiens 


HUM A- Human cancer 
associated protein sequence SEQ 
ID NO: 1 158. 


414 


98 


1804 


AAR114 
90 


Homo sapiens 


CALI- Tissue-plastin. 


414 


98 


1804 


gi339848 


Homo sapiens 


Human T-pIastin mRNA, 5* end. 


414 


98 


1805 


AAY194 
56 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


307 


90 


1805 


gi 13 8825 
00 


Mycobacterium 

tuberculosis 

CDC1551 


conserved hypothetical 
transmembrane protein 


70 


32 


U05 


gil 5507} 
4 


Mycobacterium 

tuberculosis 

H37R.V 


hypothetical protein Rv2673 


70 


32 


1806 


gi 165528 
50 


Homo sapiens 


cDNA FLJ32676 fis, clone 
TESTI 10001 68, weakly similar 
to PROTEIN PHOSPHATASES 
PP1 REGULATORY SUBUNIT 
SDS22. 


1265 


100 


1806 


gil28599 
36 


Mus musculus 


putative 


962 


78 


1806 


AAM388 
77 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2022, 


200 


45 


1807 


AAY413 
90 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene 83 clone 
HRAAB15. 


813 


100 


1807 


AAM399 
90 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 3135. 


581 


44 


1807 


AAM389 
99 


Homo sapiens 


HYSE- Human polypeptide SEQ 
ID NO 2 1 44. 


581 


44 


1808 


gil28361 
97 


Mus musculus 


putative 


2154 


75 


1808 


AAM823 
98 


Homo sapiens 


HUMA- Human 
immune/haematopoietic antigen 
SEQIDN0:9991. 


750 


90 


1808 


AAG040 
69 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 8150. 


493 


100 


1809 


AAB952 
52 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17419. 


3112 


99 
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1809 


gi 1 04347 
29 


Homo sapiens 


cDNA FLJ 12949 fis, clone 
NT2RP2005336, weakly similar 
to TRICHOHYAUN, 


3112 » 


99 


1809 


gi 128040 
75 


Homo sapiens 


hypothetical protein FLJI2949, 
clone MGC:11261 
IMAGE:3j42403, mRNA, 
complete cds. 


1786 | 


100 


1810 


AAY195 
87 


Homo sapiens 


HUM A- Amino acid sequence 
of a human secreted protein. 


160 


96 


1311 


gil56208 
81 


Homo sapiens 


mRNA for KLAA1911 protein, 
partial cds. 


1153 


100 


1811 


AAM916 
41 


Homo sapiens 


HUMA- Human 
immuneyhaematopoietic antigen 
SEQ ID NO: 19234. 


326 


50 


l&ll 


gil28458 
02 


Mus musculus 


putative 


309 


58 


1812 


gil04399 
U 


Homo sapiens 


cDNA: FLJ23282 fis, clone 
HEP07626. 


2677 


100 


1812 


AAY996 

53 


Homo sapiens 


INCY- Human GITase 
associated protein-4. 


2324 


100 


1812 


gil01671 

2 


Rattus 
norvegicus 


Fos-related antigen 


2060 


84 


1813 


AAY601 

52 


Homo sapiens 


META- Human endometrium 
tumour EST encoded protein 
212, 


379 


too 


1813 


AAY601 
51 


Homo sapiens 


META- Human endometrium 
tumour EST encoded protein 
211, 


83 


73 


1813 


AAB929 
84 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 11 704. 


71 


32 


1814 


gi 146029 
95 


Homo sapiens 


hypothetical protein FLJ23375, 
clone MGC:16634 
IMAGE:4121449, mRNA, 
complete cds* 


2299 


100 


1814 


gil04400 
40 


Homo sapiens 


cDNA: FLJ23375 fis, clone 
HEP16206. 


2294 


99 


1814 


AAB429 
73 


Homo sapiens 


CURA- Human ORFX 
ORP2737 polypeptide sequence 
SEQ ID NO:5474. 


725 


98 


1815 


AAM787 
22 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 1384. 


1808 


99 


1815 


gi!50539 
87 


Homo sapiens 


c-Mpl binding protein mRNA, 
complete cds. 


1439 


100 


1815 


AAE1019 
9 


Homo sapiens 


HYSE- Human bone marrow 
derived contig polypeptide, SEQ 
ID NO: 64. 


1231 


85 


1816 


gi 104404 
74 


Homo sapiens 


mRNA for FLJ00074 protein, 
partial cds. 


1001 


100 


1816 


gi702096 
9 


Homo sapiens 


cDNA FLJ20703 fis, clone 
KAIA1965. 


546 


63 


1816 


giI33251 
42 


Homo sapiens 


DKF2P586I2223 protein, clone 
MGC:10840 IMAGE:3616057, 
mRNA, complete cds. 


535 


«58 


1817 


gi 100472 
49 


Homo sapiens 


mRNA for KIAA1587 protein, 
partial cds. 


5002 


100 


1817 


gi 136764 

86 


Macaca 
fascicularis 


hypothetical protein 


4641 


92 
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1817 


gi 126591 
40 


Mus musculus 


mage-el ^ 


2474 


67 


1818 


AAY725 
96 


Homo sapiens 


ZYMO Human cytokine aljjha 
protein-27 (Zalpha27). 1/ 


3538 


99 


1818 


gil27102 
97 


Homo sapiens 


unnamed protein product • 


3538 


99 


1818 


gi 104375 
88 


Homo sapiens 


cDNA: FLJ21478fis, clone 
COL0501Z 


2793 


100 


1819 


gi 128387 
32 


Mus musculus 


putative 


1060 


91 


1819 


gi657221 
5 


Homo sapiens 


Human DNA sequence from 
cloneRPK37E16on 
chromosome 22 Contains the 3* 
part of the gene for a novel VHS 
domain containing protein 
similar to predicted worm and 
numan proteins, tne ctHiBFl 
gene for SH3-domain binding 
protein 1, the gene for a novel 
protein similar to 
nuropnenyipnospn&tases trom 
various organisms, the LOALS1 
gene for soluble galactoside- 
binding lectin 1, a novel gene 

finH tru* CrfTIP fnr n nni/pl ni*/\tA!n 
auu uiti ^[^ii^ iui ti uuvci pi u lulu 

similar to mouse RIP3 (P116 

Rho-ihteractinp nrotpin^ and rat 
RBI 09, complete sequence. 


647 


46 


1819 


gi 126531 
07 


Homo sapiens 


hvnothetical nmtein dTC7Fl f> S 
clone MGC:8472 
IMAGE;2821743, mRNA, 
complete cds. 


fid 7 




1820 


AAB736 
90 


Homo sapiens 


INCY- Human oxidoreductase 
protein ORP-23. 


2502 


100 


1820 


gi 104382 
22 


Homo sapiens 


cDNA: FU21988 fis, clone 
HEP06320. 


2502 


100 


1820 


gi 143367 
39 


Homo sapiens 


I6pl3.3 sequence section 3 of 8. 


2431 


90 


1821 


gi372410 
5 


Homo sapiens 


hHa4 gene for keratin type I . 


2029 


99 


1821 


gi372410 
1 


Homo sapiens 


hHa3-I gene for keratin type L 


1798 


90 


1821 


gi 128526 
06 


Mus musculus 


putative 


1796 


87 


1822 


gi322823 
7 


Homo sapiens 


UHS KerB gene. 


1079 


88 


1822 


AAM794 
04 


Homo sapiens 


HYSE- Human protein SEQ ID 
NO 3050. 


877 


81 


1822 


gi200962 


Mus musculus 


serine 1 ultra high sulfur protein 


867 


70 


1823 


AAB863 
64 


Homo sapiens 


MEMO- Human ceramidase K2 
protein. j 


1459 


100 


1823 


AAB189 
86 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


1459 


100 


1823 


gi!46694 
36 


Homo sapiens 


alkaline phytoceramidasc 
(APHC) mRNA, complete cds. 


1459 


100 


1824 


AAW616 
01 


Homo sapiens 


INCY- Human metal lothionein 
HMBP-1. 


381 


98 
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1824 


AAB571 
83 


H<!mo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
ID NO:176L 


363 


84 


1824 


AA0138 
69 


rfj^o sapiens 


HYSE- Human polypeptide SEQ 
ID NO 27761. 


362 


81 


1825 


gil 59288 
96 


Horflo sapiens 


Similar to synaptotagm in-like 4, 
clone MGC:17313 
IMAGE;39O8307, mRNA, 
contnlete cds 


3496 


100 


1825 


gil73859 
44 


Rattus 
norvegicus 


granuphtlin A 


3225 


91 


1825 


6 


lWu<; mii^iriihiQ 

1V1UO HIUDVyUlUiS 


crrHFinnni 1 irwa 
^laiiujjiuiiii u 


J 1 o / 




1826 


gil26531 
47 


Homo sapiens 


signal sequence receptor, beta 

i trait <i In pnn.ne cr\pi a tJ^H nrrvtpln 

beta), clone MGC:8566 
complete cds. 


864 


99 


1826 


gi452757 


Homo sapiens 


H.sapiens mRNA for TRAP beta 
subuntt. 


847 


99 


1896 

1 QAU 


0 


numv oapidid 


signal sequence receptor, 
complete cds. 




OQ 


1827 


AAB427 
22 


Homo sapiens 


CURA- Human ORFX 
ORF2486 polypeptide sequence 
SEQIDNO:4972. 


2865 


100 


1827 


AAB952 
63 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17448. 


2864 


99 




59 


Homo sapiens 


cuina rJUJ t^yov us, clone 
NT2RP2005841, weakly similar 
to Homo sapiens mRNA for 

Af PYl 
r\LjiZi/\j . 


£00 4 


99 


1828 


gi798129 
7 


Homo sapiens 


Human DNA sequence from 

chromosome lp31. 2-32.3. 
Contains the PGM I gene for 
phosphoglucomutase 1, a novel 
gene, ESTs, STSs, GSSs and a 
putative CpG island, complete 
sequence. 


3323 


100 


1828 


gil60418 
46 


Homo sapiens 


clone MGC:9635 
IMAGE:3915942j mRNA, 
complete cds. 


3308 


99 


1828 


gil40178 
15 


Homo sapiens 


mRNA for KIAA1799 protein, 
partial cds* 


3186 


100 


1829 


gi943803 
3 


Homo sapiens 


ser/arg-rich pre-mRNA splicing 
factor SR-AI (SR-A1) gene, 
complete cds. 


6833 


100 


1829 


gil04404 
02 


Homo sapiens 


mRNA for FU00034 protein, 
partial cds. 


6827 


99 


1829 


gil43853 
2 


Rattus » 
norvegicus 


rAl 


5019 


82 


1830 


AAG812 
94 


Homo sapiens 


ZYMO Human AFP protein 
sequence SEQ ID NO: 106. 


1128 


100 


1830 


AAU035 
89 


Homo sapiens 


INCY- Human DNA 
modification protein, DNAMP- 
4. 


1128 


100 



239 



WO 02/070539 



PCT/!]S02/050?>5 



SEQ 
TD 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 


1830 


gi 173902 
02 


Homo sapiens 


Similar to RIKEN cDNA 
2510005D08 gene, clone 
MGC:27120 IMAGE:479312I, 
mRNA, complete cds. 


1128 


100 


1831 


AAY652 
82 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ ID NO:1443. 


472 


95 


1831 


gil28386 
27 


Mus musculus 


putative 


444 


68 


1831 


gi651386 
7 


Strongylocentrot 
us purpuratus 


tektin Al 


292 


40 


1832 


AAY108 
37 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


434 


100 


1832 


gi727160 
6 


Fowlpox virus 


ORF FPV108 Virion envelope 
protein 


72 


28 


1832 


£i333523 


Pigeonpox virus 


major envelope antigen 


72 


28 


1833 


AAR561 
66 


Homo sapiens 


USSH Neuroendocrine tumor 
dlk. 


2003 


95 


1833 


gil534l9 
94 


Homo sapiens 


clone MGC:17291 
IMAGE:4347187, mRNA, 
complete eds. 


2003 


95 


1833 


gil 55593 
10 


Homo sapiens 


clone MGC:20310 
IMAGE:4130556, mRNA, 
complete cds, 


2000 


94 


1834 


AAM514 
65 


Homo sapiens 


TAKE Human G protein- 
coupled receptor protein TGR5. 


1712 


100 


1834 


AAY865 
40 


Homo sapiens 


HUMA- Human gene 77- 
encoded protein fragment, SEQ 
IDNO:457. 


679 


98 


1834 


AAY862 
91 


Homo sapiens 


HUMA- Human secreted protein 
HDPRK33, SEQ ID NO:206, 


514 


98 


1835 


gil65525 
98 


Homo sapiens 


cDNA FU32467 fis, clone 
SKNMC2000097, moderately 
similar to M. musculus mRNA 
for protein Htf9C 


2883 


99 


1835 


gil 54264 
94 


Homo sapiens 


Hpall tiny fragments locus 9C, 
clone MGC:14943 
1MAGE:4054100> mRNA, 
complete cds. 


2517 


100 


1835 


gi 104375 
55 


Homo sapiens 


cDNA: FLJ21453 fis, clone 
COL04585. 


2517 


100 


1836 


gi724306 
1 


Homo sapiens 


mRNA for KIAA1340 protein, 
partial cds. 


2328 


99 


1836 


gi388214 
3 


Homo sapiens 


mRNA for KIAA071 1 protein, 
complete cds. 


311 


28 


1836 


gil 46029 
92 


Homo sapiens 


clone MGC: 16635 
IMAGE:4121528> mRNA, 
complete cds. 


309 


29 

- 


1837 


AAB947 
27 


Homo sapiens 


HEL1- Human protein sequence 
SEQ ID NO: 15753. 


2624 


96 


1837 


gi 104358 
06 


Homo sapiens 


cDNAFLJ13710fis, clone 
PLACE2000373, weakly similar 
to F-SPONDIN PRECURSOR. 


2624 


96 


1837 


gil42507 
42 


Homo sapiens 


hypothetical protein FLJ13710, 
clone MGC:14171 
IMAGE:4120678, mRNA, 
complete cds. 


2473 


98 


1838 


AAB199 


Homo sapiens 


INCY- Human oxidoreductase 


1900 


99 
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31 




OXRD-6. 






1838 


gi 170443 
56 


Homo sapiens 


unnamed protein product 


1900 


99 


1838 


AAW677 
37 


Homo sapiens 


REGC Human fsh05 gene 
protein product. 


1528 


100 


1839 


gil04400 
14 


Homo sapiens 


cDNA: FLJ23356 fis, clone 
HEP14919. 


1859 


99 


1839 


gil52082 
23 


Macaca 
fascicularis 


hypothetical protein 


1798 


96 


1839 


gil28536 
89 


Mus musculus 


putative 


1535 


80 


1840 


gi 142505 
12 


Homo sapiens 


clone MGC: 1 5468 
IMAGE:2966921,mRNA, 
complete cds. 


966 


88 


1840 


gi 140432 
62 


Homo sapiens 


Similar to RlKENcDNA 
1500026B10gene, clone 
MGC:15737 IMAGE:3355622, 
mRNAj complete cds. 


966 


88 


1840 


gil28377 
54 


Mus musculus 


putative 


710 


68 


1841 


gi767162 
9 


Homo sapiens 


Human DNA sequence from 
clone RP1M45L22 on 
chromosome 6p21. 32^22.2. 
Contains the gene for 
myelin/oligodendrocyte 
glycoprotein (MOG), the gene 
for a novel KRAB box 
containing C2H2 type zinc 
finger protein, ESTs, STSs> 
GSSs and a CpG island, 
complete sequence. 


2247 


90 


1841 


gi43U82 


Mus musculus 


Zfp-57 


674 


39 


1841 


gI144956 
50 


Homo sapiens 


zinc finger protein 331; zinc 
finger protein 463, clone 
MGC:15739 IMAGE :33 5 5780, 
mRNA, complete cds. 


444 


29 


1842 


gi 104399 
51 


Homo sapiens 


cDNA; FLJ2331I fis, clone 
HEP11681. 


2175 


99 


1842 


ABB1246 
1 


Homo sapiens 


HYSE- Human bone marrow 
expressed protein SEQ ID NO: 
300. 


259 


too 


1842 


gi554172 
2 


Arabidopsis 
thaliana 


putative protein 


218 


44 


1843 

-* 


gi999288 
4 


Homo sapiens 


vacuolar proton pump 1 16 kDa 
accessory subunit (ATP6N1B) 
mRNA^ complete cds, 
alternatively spliced. 


4378 


100 


1843 


AAB601 
00 


Homo sapiens 


INCY- Human transport protein 
TPPT-20. 


4331 


99 


1843 


gil69032 
13 


Mus musculus 


H-ATPase accessory subunit a4 


3809 


85 


1844 


AAM938 
59 


Homo sapiens 


HELI- Human polypeptide, SEQ 
ID NO: 3953. 


3149 


100 


1844 


gi 172249 
57 


Homo sapiens 


cation ic amino acid transporter 
(SLC7A3) mRNA, complete 
cds. 


3149 


100 


1844 


AAM933 


Homo sapiens 


HELI- Human polypeptide, SEQ 


3142 


99 
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85 




ID NO: 2970. 


— t 




1845 


gil20530 
07 


Homo sapiens 


mRNA; cDNA 

DKFZp434D1812 (from clone 
DKFZp434D1812); complete 
cds. 


603| 

1 


99 


1845 


gi424019 
5 


Homo sapiens 


mRNA fbr KIAA0853 protein, 
partial cds. 


4974^ 


100 


1845 


AAB670 
47 


Homo sapiens 


INCY- Human immune response 
molecule (IMUN) protein SEQ 
ID NO:l. 


3777 


99 


1846 


AAB941 
08 


Homo sapiens 


HELr- Human protein sequence 
SEQIDNO:14340. 


2903 


100 


1846 


AAU045 
57 


Homo sapiens 


GETH Human Stra6 homologuc, 
PRO10282. 


2903 


100 


1846 


gi 135609 
66 


Homo sapiens 


STRA6 isoform 1 mRNA, 
complete cds, alternatively 
spliced. 


2903 


100 


1847 


AAB5&3 
63 


Homo sapiens 


ROSE/ Lung cancer associated 
polypeptide sequence SEQ ID 
701. 


309 


100 


1847 


AAY485 
07 


Homo sapiens 


META- Human breast tumour- 
associated protein 52. 


308 


98 


1847 


AAM239 
52 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1477. 


294 


98 


1848 


AAM937 
37 


Homo sapiens 


HELl- Human polypeptide, SEQ 
ID NO: 3705. 


929 


49 


1848 


gi3 17060 
9 


Rattus 
norvegicus 


monocarboxylatc transporter 
MCT3 


631 


35 


1848 


gi767044 
6 


Mm musculus 


unnamed protein product 


631 


47 


1849 


AAB953 
59 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:17661. 


3130 


99 


1849 


gil04350 
88 


Homo sapiens 


cDNA FLT13181 fis, clone 
NT2RP3004016, weakly similar 
to TRANSCRIPTION 
INTERMEDIARY FACTOR 1- 
BETA* 


3130 


99 


1849 


gi729735 
7 


Drosophila 
meJanogaster 


CG8419 gene product 


746 


29 


1850 


gi 104368 
13 


Homo sapiens 


cDNA: FLJ20859 fis, clone 
ADKA01617. 


2426 


100 


1850 


gi 134771 
83 


Homo sapiens 


Similar to hypothetical protein 
FLJ20859, clone MGC:12940 
IMAGE:2822127 5 mRNA, 
complete cds. 


2357 


98 


1850 


gi!28308 
10 


Homo sapiens 


false p73 target protein gene, 
complete cds. 


2239 


99- 


1851 


AAM243 
67 


Homo sapiens 


HYSE- Human EST encoded 
protein SEQ ID NO: 1892, 


546 


100 


1851 


AAY275 
76 


Homo sapiens 


HUMA- Human secreted protein 
encoded by gene No. 10. 


394 


96 


1851 


gi30478 


Homo sapiens 


Human mRNA for dopamine 
beta-hydroxylase type b (EC 
1.14.17.1). 


68 


26 


1852 


gi!655l5 
61 


Homo sapiens 


cDNA FLJ31657 fis, clone 
NT2RI2004304, moderately 
similar to Homo sapiens NY- 


2859 


92 
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RJEN-2 antigen mKNA. 






1852 


AAB366 
26 


Homo sapiens 


INCY- Human FLEXHT-48 
protein sequence SEQ ID 
NO:48. 1 


2056 


67 


1852 


gil28034 
69 


Homo sapiens 


high-glucose-regi&ted protein 
8, clone MGC:739& 
IMAGE:3 139250, mRNA, 
complete cds. 


2056 


67 


1853 


gi776873 
6 


Homo sapiens 


genomic DNA, chromosome 
2 lq, section 87/105. 


4306 


99 


1853 


gi 142457 
29 


Homo sapiens 


ANKRD3 mRNAfor dual- 
specificity Ser/Tlir/Tyr kinase, 
complete cds. 


4003 


94 


1853 


gi988671 
1 


Homo sapiens 


mRNA for protein kinase (dik 
gene). 


3999 


94 


1854 


gil04391 
29 


Homo sapiens 


cDNA: FLJ22655 fis, clone 
HSI07590. 


960 


too 


1854 


AAG014 
59 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 5540. 


668 


100 


1854 


AAU173 
65 


Homo sapiens 


HUM A- Novel signal 
transduction pathway protein* 
Seq ID 930. 


523 


too 


1855 


gi!00471 
85 


Homo sapiens 


mRNA for KIAA1560 protein, 
partial cds. 


3397 


100 


1855 


gi!75124 
95 


Mus musculus 


glycerol-3-phosphate 
acyltransferase, mitochondrial 


3371 


93 


1855 


gi 193367 


Mus musculus 


glyceroI-3-phosphate 
acyltransferase 


3363 


93 


1856 


ABB1223 
6 


Homo sapiens 


HYSE- Human eppin-1 
homologue, SEQ IDNO:2606. 


472 


100 


1856 


giJ39373 
34 


Homo sapiens 


Human DNA sequence from 
clone RP3~46lP17on 
chromosome 20q 12-13,2. 
Contains two novel genes, gene 
HE4 for Major Epididymis- 
specific protein E4 precursor 
(Epididymis Secretory protein 
E4), RPL5 (60S Ribosomal 
Protein L5), COX6C 
(Cytochrome C Oxidase subunit 
VIC) and HSPDI(HSP60, 
Mitochondrial Matrix Protein PI 
precursor, Heat Shock Protein 
60, GROEL, HUCHA60) 
pseudogenes, the SPINT3 gene 
for Kunitz type serine protease 
inhibitor 3 (HKIB9), two genes 
for novel Kunitz/Bovine 
pancreatic trypsin inhibitor and 
WAP-type (Whey Acidic 
Protein) 'four-disulfide core' 
domains containing proteins and 
the gene for Eppin-1, -2 and -3. 
Contains ESTs, STSs, GSSs and 
a CpG island, complete 
sequence. 


415 


98 

- 


1856 


gil359l7 

53 


Oryctolagus 
cuniculus 


eppin 


257 


66 
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iu entity 


1857 


$128558 


Mus musculus 


putative 


758 


57 


1857 


^77366 


Homo sapiens 


Human DNA sequence from 
clone RP3-341D10 on 
chromosome X Contains a gene 
for a novel orotein n&rt of the 
gene for a protein similar to 
ADP rtbosylation factor 3, part 
of a gene similar to HTF9C and 
a CpO island, complete 
sequence. 


424 


100 


1857 


AAM389 
58 


Homo sapiens 


HYSE- Human Dolvoentide SEO 
ID NO 2103. 


421 




1858 


gi 104395 
20 


Homo sapiens 


cDNA: FLJ22973 fis, clone 
KAT11042. 


2289 


100 


1858 


gill 6123 
88 


Homo sapiens 


zinc finger transcription factor 


2279 


99 


1858 


AAU161 
42 


Homo sapiens 


HUMA- Human novel secreted 

nrotein fipn TO t OOS 


1745 


98 


1859 


gi 135440 
26 


Homo sapiens 


putative zinc finger protein from 
EUROIMAGE 566589, clone 
MGO 111 09 TMAGF-^9S94^6 

mRNA, complete cds. 


1481 


99 


1859 


AAB939 
32 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 13929. 


1474 


99 


1859 


ei 104336 
47 


T-fninn •wmipfw 


MAMMA1001105, moderately 
similai- to OVO PROTEIN 


14.74 


yy 


1860 


AA0786 
15 


Homo sapiens 


SHAN- Human zinc finger 
transCrintion factor BtoZFTF45 


1760 


89 


1860 


gi 136234 
31 


Homo sapiens 


clone MGC:I3132 
IMAGE T 4 124255 mRNA 
complete cds. 


1753 


100 


1860 


gil31!18 
97 


Homo sapiens 


Similar to ICIAA041 4 protein, 
clone MGC:2629 
IMAGE:3503643, mRNA, 
complete cds. 


1753 


100 


1861 


AAB622 
01 


Homo sapiens 


RIGE- Cell cycle protein Radh- 
isoform 1. 


3697 


99 


1861 


gi8l6380 
4 


Mus musculus 


putative repair and 
recombination helicase RAD26L 


3215 


89 


1861 


AAB622 
02 


Homo sapiens 


RIGE- Cell cycle protein Radli^ 
isoform 2. 


2142 


99 


1862 


gi259856 
5 


Mus musculus 


rabi9 


1008 


88 


1862 


AAM789 
77 


Homo sapiens 


HYSE- Human protein SEQ IP 
NO 1639. 


603 


56- 


1862 


gI729512 
7 


Drosophila 
melanogaster 


Rab-RP3 gene product 


523 


55 


1863 


gil73840 
67 * 


Homo sapiens 


Human DNA sequence from 
clone RP1M46P21 on 
chromosome 10 Contains the 
3'end of a novel gene* a novel 
gene, the 5'end of the gene for 
KIAA0608 and a CpG island, 
complete sequence. 


2467 


99 


1863 


gi 140399 


Cricetulus 


hypothetical protein 1-2 


2311 


92 
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64 


griseus 








1863 


AAB951 
96 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17284. 


2183 


99 


1864 


gi 126539 
87 


Homo sapiens 


cutaneous T-cell lymphoma 
tumor antigen se70-2, clone 
MGC:5291 IMAGE :345 1565, 
mRNA, complete cds. 


1237 


99 


1864 


gil 13856 

62 


Homo sapiens 


CTCL tumor antigen se70-2 
mRNA, complete cds. 


1237 


99 


1864 


gil 04369 
25 


Homo sapiens 


cDNA: FLJ20957 fls, clone 
ADSE02053. 


1233 


99 


1865 


gil04399 
11 


Homo sapiens 


cDNA: FLJ23282 fis> clone 
HEP07626. 


2579 


100 


1865 


AAY996 
53 


Homo sapiens 


INCY- Human GTPase 
associated protein-4* 


2544 


100 


1865 


gi!0l671 
2 


Rattus 
norvegicus 


Fos-related antigen 


2256 


84 


1866 


AAB530 
73 


Homo sapiens 


GETH Human anglogenesis- 
associated protein PR0195, SEQ 
IDN0:46. 


331 


100 


1866 


AAB884 
28 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0203. 


331 


100 


1866 


AAU123 
07 


Homo sapiens 


GETH Human PRO 195 
polypeptide sequence. 


331 


100 


1867 


AAB941 
97 


Homo sapiens 


HELI- Human protein sequence 
SEQ1DN0:1453Z 


4854 


99 


1867 


gil 04342 
43 


Homo sapiens 


cDNA FLJ12634 fis, clone 
NT2RM4001858, weakly similar 
to T-BOX CONTAINING 
PROTErN TOX6L. 


4854 


99 


1867 


gi669260 
7 


Mus musculus 


MGA protein 


4810 


84 


1868 


AAB957 
79 


Homo sapiens 


HELI- Human protein sequence 
SEQEDNO:18726. 


1326 


100 


1868 


AAB428 
78 


Homo sapiens 


CURA- Human ORFX 
ORF2642 polypeptide sequence 
SEQIDNO:5284. 


1326 


100 


1868 


gil 04366 
25 


Homo sapiens 


cDNA FLJ14220 fis, clone 
NT2RP3003828. 


1326 


100 


1869 


AAY108 
23 


Homo sapiens 


HUMA- Amino acid sequence 
of a human secreted protein. 


183 


100 


1869 


gil 17158 
4 


Plasmodium 
falciparum 


red algal chloroplast 


72 


29 


1869 


gI630468 


Plasmodium 
falciparum 


hypothetical protein 470 - 
Plasmodium falciparum > 


72 


29 


1870 


gi 143367 
13 


Homo sapiens 


16pl3.3 sequence section 3 of 8. 


1867 


100 


1870 


gi591245 
9 


Homo sapiens 


Human DNA sequence from 
clone LA16-380A1 on 
chromosome 16 Contains two 
novel genes, ESTs, an STS, 
GSSs and five putative CpG 
islands, complete sequence. 


1106 


100 


1870 


gil 39052 
32 


Mus musculus 


Unknown (protein for 
IMAGE:360ll86) 


145 


28 


1871 


AAB433 


Homo sapiens 


CURA- Human ORFX 


2463 


97 
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82 




ORF3146 polypeptide sequence 
SEQ ID NO:6292. 




1 


1871 


AAW734 
00 


Homo sapiens 


HUMA- Human secreted protein 
encoded by Gene No. 4. 


2454 


!()| 


1871 


AAB583 
40 


Homo sapiens 


ROSE/ Lung cancer associated 
polypeptide sequence SEQ ID 
678. 


2252 


908 


1872 


AAB569 
24 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
ID m A 502. 


2048 


95 


1872 


AAY599 
93 


Homo sapiens 


META- Human endometrium 
tumour EST encodedprotein 53. 


2048 


95 


1872 


gi476122 
3 


Homo sapiens 


NADP+~dependent isocttrate 
dehydrogenase (PICD) mRNA, 
complete cds* 


2048 


95 


1873 


AAB643 
73 


Homo sapiens 


INCY- Amino acid sequence of 
human intracellular signalling 
molecule INTRA5. 


1875 


too 


1873 


AAB949 
37 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:16430. 


1546 


99 


1873 


giI52919 
19 


Drosophila 
melanogaster 


LD3l969p 


1237 


41 


1874 


AAB643 
73 


Homo sapiens 


INCY- Amino acid sequence of 
human intracellular signalling 
molecule INTRA5. 


1836 


92 


1874 


AAB949 
37 


Homo sapiens 


HELI- Human protein sequence 
SEQ IDNO:16430. 


1546 


99 


1874 


gil52919 
39 


Drosophila 
melanogaster 


LD31969p 


1209 


39 


1875 


gi 136235 
1 1 


Homo sapiens 


clone MGC:12921 
IMAGE;41 29897, mRNA, 
complete cds. 


590 


100 


1875 


gil26982 
16 


Macaca 
fascicularis 


hypothetical protein 


589 


99 


1875 


AAM936 
16 


Homo sapiens 


JffiLI- Human polypeptide, SEQ 
ID NO: 3444. 


584 


99 


1876 


gi606368 
8 


Homo sapiens 


ammecrl gene, exon 1 and 
joined CDS (alternative 
transcripts). 


968 


56 


1876 


gi513948 

2 


Homo sapiens 


mRNA for AMMECR1 protein. 


968 


56 


1876 


gi889465 
7 


Mas musouJus 


AMMECRl 


964 


53 


1877 


gil04402 

1 o 

IS 


Homo sapiens 


cDNA: FLJ23506 fis, clone 
LNG03055. 


2913 


99 


1877 


AAY733 


Homo sapiens 


INCY- HTRM clone 2762174 
protein sequence. 


2110 


100 


1877 


AAU162 
62 


Homo sapiens 


HUMA- Human novel secreted 
protein, ScqlD 1215. 


1286 


98 


1878 


AAG671 
51 


Homo sapiens 


INCY- Amino acid sequonco of 
a human enzyme. 


1689 


99 

<* 


1878 


gil28562 
10 


Mus musculus 


putative 


1457 


85 


1878 


gi312821 
8 


Arabidopsis 
thatiana 


putative katanin 


874 


56 


1879 


AAB600 

93 


Homo sapiens 


INCY* Human transport protein 
TPPT-13. 


3295 


95 
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1879 


gi 172237 
24 


Homo sapiens 


sodium/glucose cotransporter ^ 
KST1 mRNA, complete cds. | 


3295 


95 


1879 


AAF8402 
3 aal 


Homo sapiens 


LEX1- Novel human transport^ 
protein (NHP) encoding DNA.B 


3289 


95 


1880 


AAB60O 
93 


Homo sapiens 


INC Y- Human transport proteif 
1PPT-13. * 


3461 


99 


1880 


gil72237 
24 


Homo sapiens 


sodium/glucose cotransporter 
KST1 mRNA, complete cds. 


3461 


99 


1880 


AAF8402 
3 aal 


Homo sapiens 


LEXI- Novel human transporter 
protein (NHP) encoding DNA. 


3455 


99 


1881 


gi 128454 
75 


Mus muse ul us 


putative 


1648 


70 


1881 


gi!35433 
07 


Homo sapiens 


cargo selection protein 
(man nose 6 phosphate receptor 
binding protein), clone 
MGCrll 1 17 IMAGE:38334U, 
tnRNA ? complete cds. 


725 


39 


1 SRI 


AAY672 
40 


Homo sapiens 


INCY- Human adipophilin-like 
protein (HALP) amino acid 
sequence. 


724 




1882 


AAW469 
04 


Homo sapiens 


AS AH A human mutant alanine 
aminotransferase. 


I Oil 




1882 


gil76309 
6 


Homo sapiens 


Human glutamate pyruvate 
transaminase (GPT) gene, 
complete cds. 


1821 




1882 


gil73904 
65 


Homo sapiens 


glutamic-pyruvate transaminase 
(alanine aminotransferase), clone 
MGC:17068 IM AGE :4 179699, 
mRNA, complete cds. 


1821 


68 


1883 


AAD094 
95_aal 


Homo sapiens 


SUGE- Human SGP003 
phosphatase polypeptide 
encoding DNA. 


1161 


too 


1883 


AAE0483 
7 


Homo sapiens 


SUGE- Human SGP003 
phosphatase polypeptide. 


1159 


99 


1883 


AAB186 

67 


Homo sapiens 


INCY- A human regulator of 
intracellular phosphorylation. 


1021 


91 


1884 


gi308942 
7 


Homo sapiens 


SSC6 rearranged T cell receptor 
beta chain (TCRBVI7) gene, 
complete cds. 


1089 


69 


1884 


gi300292 
7 


Homo sapiens 


Tcell receptor beta chain 
(TCRBV1 7S 1 -TCRB J 1 S5) 
mRNA, complete cds. 


1089 


69 


1884 


gi298250 
8 


Homo sapiens 


mRNA for TCR beta chain, 
specific for Mage 3/HLA-A2. 


1089 


70 


1885 


gil28582 
21 


Mus musculus 


putative 


1850 


91 


1885 


gil23141 
02 


Homo sapiens 


Human DNA sequence from 
clone RP3-329L24 on 
chromosome 6q22. 1-22.33 
Contains a gene for a novel 
protein, part of a gene for a 
hypothetical 23.0 KD protein, 
part of a gene for a protein 
(MCM2/3/5 family), ESTs, 
$T$s, GSSs and a CpG island, 
complete sequence. 


1405 


100 


1885 


gi438883 


Arabidopsis 


putative DNA replication 


726 


41 
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2 


thaliana 9 


licensing factor 






lOOU 


00 


OUIIIU pap* till* i 


clnnf TMAGT 3 -^354344 

mRNA, partial cds. 


3001 


99 


1 CCA 
loiSO 


A A T3 0/tl 

86 


LTAmn cantsnct Ml 
rlUn 1 U bapiblla BJ 


WPWR* PTnmfltt iipurnl c.p,\\ 

protein marker RR/B. 


673 


29 


1 ftft/t 
loftu 


gij/oj/jo 
2 


Mnmn ecinipffcc V 


prtnrlprm-npurnl CfjftPY«1 hfrttftifi 

(ENC-1) mRNA, complete cds. 


673 


29 




gt 1UU4/Z 

39 


riomo Sapiens 


partial cds. 


7698 


96 


loo/ 


gl lOJjliS 

20 


Homo sapiens 


rHMA PI m fid rlfinp 

NT2RP700123L 


3076 


100 


1887 


gi79591o 
1 


Homo sapiens 


mKJNA lor jviA/\i40U protein, 
partial cds. 






1 ooo 


39 


Homo sapiens 


mDM A fnr VTA A 1 1ft'? nrntp»in 
JTIKFVA IOr JY1AA J JoZ piOlLJU, 

partial cds. 


/ oVv 




1888 


gi 16551 8 
20 


Homo sapiens 


cuinA rLjj 1 ojy us, clone 
MT2RP7001231, 


7RTS 




1888 


gi795918 
1 


Homo sapiens 


mRNA lor jV1AA140U protein, 
partial cds. 






1889 


gi 100472 
39 


Homo sapiens 


mRNA for KIAA1582 protein, 
partial cds. 


7372 


95 


1889 


gi795918 
I 


Homo sapiens 


mRNA for KIAA1460 protein, 
partial cds. 


2618 


44 


1889 


gil65518 
20 


Homo sapiens 


c0NAFLJ31859fis, clone 
NT2RP700123K 


2354 


81 


1890 


gi 163072 
85 


Homo sapiens 


Clone IMAGE:3877337, 
mRNA, partial cds. 


1627 


98 


1890 


gil 52080 
51 


Macaca 
fascicularis 


hypothetical protein 


1417 


55 


1890 


AAY949 
18 


Homo sapiens 


GEMY Human secreted protein 
clone dd504_18 protein 
sequence oeaj iu tNu/fz. 


lJ.*f ( 




1891 


gi'278041 
4 


Homo sapiens 


hBACH mRNA for brain acyl- 
loa nyuruiase, compieic i>ub* 


1742 


100 


1891 


gil69243 
33 


Homo sapiens 


cytosolic acyl coenzyme A 
thioester hydrolase, clone 
MGC:1 126 IMAGE'3507488, 
miviNA ? complete wu:>. 


1742 


100 


1891 


AAW748 


Homo sapiens 


HUMA- Human secreted protein 

i*nnrtAp>r\ Kv npnp I AO fifing 

ciu>uuuu uy g,cuc ivy uiviic 
HPTTU1 1. 


1715 


98 




AADOOtS 

74 


noiuO sapiens 


TMPY- Human RFPAP 

polypeptide, SEQ IE) NO: 4, 


1266 


100 




A A V04.R 

90 




PttOT- Human nmtein clone 
HP02798. 


1266 


100 


1892 


gil 44245 
18 


Homo sapiens 


Similar to R1KEN cDNA 
I190004A01 gene, clone 
MGC:13I53 IMAGE:4302257, 
mRNA, complete cds. 


1266 


100 


1893 


gil45950 
19 


Homo sapiens 


mRNA for keratin 6 irs 
(KRT6IRS gene). 


843 


77 


1893 


gi609207 
5 


Mus musculus 


type II cytokeratin 


836 


74 


1893 


giI28345 
35 


Mus musculus 


putative 


836 


74 


1894 


gil40291 
53 


Homo sapiens 


microtubule-associated protein 
GLFND mRNA, complete cds. 


2565 


100 
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1894 


gil31119 
07 


Homo sapiens 


clone N/IOC'^^n 

IMAGE:3502614,mRNA, 
complete ed^ 


ZjOj 


Wi) 


) 1894 
) 


gil 34477 
59 


Homo sapiens 


fibronectin type 3 and SPRY 

□OlYlfiitt-CfinfrnninQ nrntf>Iti 1 

VVH11U11J11 Ig iJiv/ivltl l 

(FSD1 ) mRNA, complete cds. 


2562 


99 


1895 


gil 44245 
91 


Homo sapiens 


hypothetical protein FLJ22127, 

clone MGC-idQ?f* 
IMAGE:4123948, mRNA, 
complete cds. 


2565 


100 


1895 


gil04380 
86 


Homo sapiens 


cDNA: FLJ21886 fis, clone 
HEP03022. 


2555 


99 


1895 


gil04384 
00 


Homo sapiens 


cDNA: FLJ22127 fis, clone 
HEP19530. 


1566 


100 


1896 


gi!5l503 
06 


Homo sapiens 


glycogenin-interacting protein 3 
mRNA, complete cds. 


1571 


100 


1896 


gil5!503 
04 


Homo sapiens 


glycogenin-interacting protein 2 
mRNA, complete cds, 
alternatively spliced. 


1571 


100 


1896 


gil5l502 
98 


Homo sapiens 


glycogenin-interacting protein 1 
mRNA, complete cds. 


1571 


100 



TABLE 3 



SEQID 
NO: 


Database 
entry ID 


Description 


Results* 


950 


BL00972 


Ubiquitin carboxy 1-terminal hydrolases 
family 2 proteins. 


BL00972A IL93 2.500e-20 267- 
285 BL00972D 22.55 5.l79e-17 
828-853 BL00972E 20,72 8.650e~ 
13 855-877 BL00972C 16.48 
7.120e-ll 411-426 BL00972B 9.45 
7.923e-10 353-363 


950 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 230 8.000e-10 2-17 


950 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308A 5.90 7.671 e-09 5-20 
PR00308A 5.90 9.471e-09 4-19 


951 


BL00018 


EF«hand calcium-binding domain proteins. 


BL00018 7.41 5.696e-09 331-344 


951 


PR00I80 


CELLULAR RETINALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00I80C 10.92 8. 821 e-09 70-92 


952 


BL00018 


EF-hand calcium-binding domain proteins. 


BL00018 7.4I 5.696e-09 33 1-344 


952 


PR00I80 


CELLULAR RETINALDEHYDE- 
BINDING PROTEIN SIGNATURE 


PR00180C 10,92 8.82Ie-09 70-92 


953 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401B 12,94 LOOOe-08 340- 
351 


954 


PR00401 


SH2 DOMAIN SIGNATURE 


PR00401B 12.94 1.000e-08 367- 
378 


955 


BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625A 16.21 7.787e-I6 308- 
337 BL00625A 16.21 7,369e-15 
190-219 BL00625B 17.69 1.514e- 
13 302-336 BL00625B 17.69 
2.286e-13 184-218 BL00625B 
17.69 3.957e-13 132-166 
BL00625A 16.21 5.690e-13 138- 
167 BL00625A 16.21 5,73 le-H 
360-389 BL00625B 17.69 3.333e~ 
10 354-388 
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SEQ ID 
NO: 


Database 
entry ID 


Description 




955 


PR00633 


CHROMOSOME CONDENSATION 
REGULATOR RCC1 SIGNATURE 


PR00633A 9 32 6 143e-09 202-219 
PRO0633H 15.10 6.268e-09 196- 
218 PR00633F 10.03 6,354e~09 
373-388 PR00633G 13.71 7.556e- 
09 190-209 


957 


BL00120 


Lipases, serine proteins. 


BL00120B 1 L37 9.486e-12 166- 
181 


957 


PR00111 


ALPHA/BETA HYDROLASE FOLD 
SIGNATURE 


PR00111B 10.61 L176e-09 170- 
184 


963 


BLO0183 


Ubiquitin-conjugating enzymes proteins. 


BLO0183 28.97 1.329e-10 45-93 


966 


PR00049 


WILM'S TUMOUR PROTEIN 


PR00049D 0.00 6.034e-09 262-277 


967 


PRO0O49 


WILM'S TUMOUR PROTEIN 
SIGNATURE 


PR00049D 0.00 6.034e-09 74-89 


968 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8,297e-09 804- 
829 


969 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8.297e~09 878- 
903 


971 


PR00380 


KINESEN HEAVY CHAIN SIGNATURE 


PR00380D 9.93 2.080e-22 321-343 
PR00380A 14.18 L486e-21 79-101 
PR00380B 12.64 6.57te-18 217- 
ZJ5 rKOUiSUC 13.18 0.9276-13 
269-288 


Q71 


TH 0(141 1 


cvincaiu nivuur uuiimiri |Jiuu-.iua, 


DLAJ\J t fl ir% I J.OO o.^vl/e-OU jZU" 

351 BL00411G 21.39 6*100e-28 
270-312 BL00411C 15.04 7.000e- 
22 79-101 BL0041 IF 14.77 
1 .273e- 1 9 208-233 BL004 1 1 E 
10.43 7.429e-l 2 142-161 
BL0041 1A 11.31 4.484e-ll 9-24 
BL00411B 13.51 L563e-10 45-62 


971 


PD00301 


PROTEIN REPEAT MUSCLE 
CALC1UM-BI. 


PD0030I A 10.24 6.400e-09 598- 
609 


971 


DM01399 


VARICELLA-ZOSTER VIRUS GENE 54 
PROTEIN. 


DM01399B 12.42 7*092e-09 1571- 
1583 



* Results include in order: accession number subtype; raw score; p- value; position of signature in amino add 



SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 


971 


BL00678 


Trp-Asp (WD) repeat proteins proteins. 


BL00678 9.67 8,800e-10 1350- 
1361 BL00678 9.67 7.158e-09 
1629A640 


971 


BL00502 


Polygalacturonase proteins. 


BL00502A 13.44 7.341e-09 1424- 
1439 


971 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 7.796e-09 568- 
619 


971 


BL00226 


Intermediate filaments proteins. 


BL00226B 23.86 8.012e-09 930- 
978 


971 


PR00320 


G-PROTEIN BETA WD^40 REPEAT 
SIGNATURE 


PR00320B 12,l9 2.385e^l2 1348- 
1363 PR00320C 13.01 L720e-t0 
1348-1363 PR00320A 16.74 
4.973e-10 3348-1363 PR00320B 
12.19 5.886e-10 1544-1559 
PR00320A 16.74 3.415e-09 1544- 
1559 PR00320C 13.01 5.500e-09 
1498-1513 PR00320B 12.19 
8.650e-09 1627-1642 PR00320C 
13.01 9. t00e-09 1627-1642 
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seq m 

NO: 


Database 
entry ID 


Description 


Results* 


973 


BL0I242 


Pormamidopyrimidine-DNA glycosylase 
proteins. 


BL01242I^17.92 5.300e-lI 32-66 


975 


PF0065I 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 lLo0 2.500e-14 46-59 


975 


PR00501 


KELCH REPEAT SIGNATURE 


PR00501 AS.25 Z750e-1 1 371- 
385 V 


976 


DM00215 


PROL1NE-RICH PROTEIN 3. 


DM002 1 5 19.43 3.898e-09 99-132 


977 


PRO038O 


KINESIN HEAVY CHAIN SIGNATURE 


X IWJKJiJ OUil 14.1 O ^.ZJuC'ZJ i*J m 

115 PR00380D9,931857e49 
302-324 PR0038OB 12 64 4 420e- 
18 212-230 PR00380C 13.18 
1 692e4 6 247-266 


977 


BL0041 1 


Kinesin motor domain proteins. 


BL004 1 1G 2 1 39 7 750e-32 248- 
290 BL00411F 14.77 1.000e-25 
203-228 BL00411C 15 04 1 621 e- 
24 93-115 BL0041IH 15.66 
L871e-24 301-332 BL0041IE 
10,43 6.625e-20 143^162 
BL00411A 11.31 4.484e-ll 5-20 


977 


PF00846 


Hantavirus nucleocapsid protein. 


PF00846H 3.96 9.182e-i0 408- 
445 


977 


BL00224 


Clathrin light chain proteins. 


BL00224B 16.94 7.136e-09 363- 
416 


978 


BL00I07 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.368e-18 36-67 


978 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 2.068e-10 36-55 


980 


BL00594 


Aromatic amino acids permeases proteins. 


BL00594A 16.75 9.376e-09 76- 
120 


982 


BL00790 


Receptor tyrosine kinase class V proteins- 


BL00790E 29.58 LIlle-12 614- 
662 BL00790E 29 58 1 1 1 1 1 2 
668-716 BL00790E 29.58 7.00Qe- 
10 560-608 


982 


BL00279 


Membrane attack complex components / 
perforin proteins. 


BL00279E 17 11 7 6T>p-19 727- 
775 BL00279E 37. 1 1 2.047e-09 
765-813 


982 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01187B 12 044 600e-ll 791- 
807 BL01187B 12.04 3.57Ie- 10 
829-845 BL01 187B 12.04 8.714e- 
10 753-769 BL01187A9.98 
4.3756-09 812-824 BL01187A 
9,98 5,125e-09 774-786 


982 


PR00343 


SELECTIN SUPERFAM1LY 
COMPLEMENT-BINDING REPEAT 
SIGNATURE 


PR00343C 16.85 5.364e-09 13-32 


982 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 8.027e-10 744- 
765 PR00764F 16.89 6.844e-09 
782-803 


982 


PR00OIO 


TYPE II EGF-LIKE SIGNATURE 


PR00010A 1179 6.192e-n 814- 
826 PR00010C 11.16 5.909e- 10 
758-769 PR00010A 13,79 7,677e- 
10 776-788 PROfiOlOC 11.16 

I. 214e-09 796-807 PR00010C 

II. 16 8.286e-09 834-845 


986 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.l50e-12 126- 
149 


987 


PR00019 


LEUCINE^RICH REPEAT SIGNATURE 


PR00019A 11.19 6.087e-l0 93- 
107 PR00019B 1L36 7.840e-09 
90-104 
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Database 
entry ID 


Description 


Results* 


990 


PD01443 


INHIBITOR CALPAIN CfcLPASTATIN 
REPEAT THIOL PROT, 


PD01443D 836 4,670e-09 815- 
837 


991 


BL01101 


Casein kinase I! regulator)|Bubunit proteins. 


BL01101A 16.07 L000e-40 9-S4 
BL01101B 10.94 9,000e-31 72-97 


991 


PR00472 


CASEIN KINASE 11 REGULATORY 
SUBUNIT FAMILY SIGl^TURE 


PR00472C 12.38 5.154e-28 80- 
102 PR00472A 8.03 7.600c-23 8- 
25 PR00472B 14.84 L000e-19 
25-40 


994 


BL0U66 


RNA polymerases beta chain proteins. 


BL01 1 66G 18.10 2.500e-34 824- 
866 BL01166H 19.05 9.4l0e-30 
936-986 BL01166D 17.37 
4,396e-l 9 612-642 BL0H66E 
13.47 8.244e-17 682-706 
BL0II66C 12.21 9.357e-12 431- 
456 


995 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 7.000e-09 25-34 


996 


BL0003S 


Myotype^ 'helix-loop-helix 1 dimerization 
domain proteins. 


BL00038A 13,61 6.625e-ll 284- 
300 


996 


DM00984 


w MYOD MYOBLAST DETERMINATION 
SHORT. 


DM00984B 15.18 3.901 e-09 262- 
317 


996 


BL0I009 


Extracellular proteins SCP/Tpx-I/Ag5/PR- 
1/Sc7 proteins. 


BL01009A 13,75 7.750e-09 260- 
278 


997 


BL00905 


GTP1/OBG family proteins. 


BL00905D 15.00 4,214e~10 125- 
140 


997 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449C 17.27 3.903e-13 63-86 
PR00449A 13.20 7.750e-l0 22-44 


997 


BL01019 


ADP-ribosylation factors family proteins. 


BL01019A 13,20 8,624e-10 56-96 


997 


BL011I5 


GTP-binding nuclear protein ran proteins. 


BL01I15B I0.S1 1.505Q-09 102- 
146 


998 


BL00107 


Protein kinases ATP-binding region proteins. 


VkT Art i ath i *\ ii n inn. -t c 7* A Art 

BL00107B 1 3.3 1 7.300e-l5 64-80 


998 


BLO0239 


Receptor tyrosine kinase class II proteins. 


BL00239E 17.14 2.693e-l0 36-86 


1001 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 5.263e-09 328- 
339 


1003 


BL00383 


Tyrosine specific protein phosphatases 
proteins. 


BL00383E 10.35 5.263 c-09 246- 
257 


1004 


BL00383 


Tyrosine specific protein phosphatases 
proteins* 


BL00383E 10.35 5.263e-09 328- 
339 


1010 


BL00226 


Intermediate filaments proteins. 


BLUU226r> 2i.oo y t ylye-[)y Dou- 
608 


1012 


PR00322 


G10 PROTEIN SIGNATURE 


PR00322E 6.62 1.720e-10 30-40 


1012 


BL00997 


G 10 protein. 


BL0U997L 6.36 i.30oe-09 29-39 


1013 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 HO- 
139 BL00269A 8.53 2.607e-20 
45-65 BL00269B 19,17 S.500e- 17 

//.-Wl 


1014 


BL0H60 


Kinesin light chain repeat proteins. 


BL01160B 19.54 8,297e^0 6-60 


1015 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 5.846e-U 476- 
493 BL00028 16,07 6.192e-ll 
989-1006 


1015 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 6.087e-10 473- 
487 PR00048A 10.52 1.000e-09 
9864000 


1015 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 6.571e-12 464- 
477 PD00066 13.92 7.000e- 12 
977-990 ?D00066 13.92 4.600e- 
09 949-962 PD00066 13.92 
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Database 
entry ID 
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Results 1 * 






| - - ■ 


7.300e-09 397-410 


1019 


BL00289 


J^ntaxin family proteins. 


BL00289E 18.004.375e-13 22-37 


1019 


PR00895 


Ipentaxin signature 


PR00895G 14.55 4.913e-10 19-31 


1022 


BL00348 


1153 tumor antigen proteins. 


BL00348F 23.19 4.57le-09 140- 
183 


1023 


BL00455 


Mutative AMP-binding domain proteins. 


BL00455 13.31 6.684e^l3 248- 
264 


1023 


PR00154 


AMP-BINDING SIGNATURE 


PR00154A 8.88 7.375e-10 241- 
253 


1026 


BL0042L 


Transmembrane 4 family proteins. 


BL00421E 20.97 1.851e-09 17-47 


1026 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259D 13.50 7.097e-09 20*47 


1028 


PD01066 


PROTEIN ZINC FINGER ZLNC-FINGER 
METAL-BINDING NU. 


PD0 1 066 1 9.43 2. 1 49e-29 6-45 


1028 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 1.692e-15 234- 
247 PD00066 13.92 3.400e-14 
150-163 PD00066 13.92 5. 800e- 
14 206-219 PD00066 13.92 
8.714e-12 178-191 


1028 


BL00028 


Zinc ringer, C2H2 type, domain proteins. 


BL00028 16.07 2.350e-13 102- 
119 BL00028 16.07 5.500e-11 
190-207 BL00028 16.07 6,1 92e- 
11 218-235 BL00028 16.07 
1.000e-09 134-151 


1028 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4 750e-14 215- 
229 PR00048A 10,52 5.500e-I3 
99-113 PR00048A 10.52 4.3l6e- 
11 131-145 PR00048B6.02 
4.462e-II 115-125 PR00048A 
10.52 1.391e-IO 159-173 
PR00048B 6.02 6.625e-I0 203- 
213 PR00048A 10,52 4.600e-09 
187-201 PR00048B 6.02 8.579e- 
09 147-157 


1029 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014D 12.04 2.059e-10 215- 
230 


1029 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790F 15.90 2.519e-09 157- 
184 


1032 


BL00269 


Mammalian defensins proteins. 


BL00269C 16.52 6.786e-26 133- 
162 BL00269A 8.53 2.607e-20 
68-88 BL00269B 19.17 5.500e-17 
95-124 


1033 


PD01876 


ANTIGEN MELANOMA-ASSOCIATED 
MULTIOENE FAMILY TUM. 


PD01876C 21.73 1.231e-20 75» 
128 


1034 


PD02870 


RECEPTOR INTERLEUKIN- 1 
PRECURSOR. 


PD02870B 18.83 8.835e^ll 326- 
359 


1034 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 
* 


PR00019A 11.19 4.150e-ll 112- 
126 PR00019B 11.36 8.000e-ll 
109-123 PR00019B il.36 8.500e- 
11 184^198 PR00019A 11.19 
6.478e-10 187-201 PR00019A 
H.19 7.333e-09 63-77 


1034 


DM00179 


w KINASE ALPHA ADHESION T-CELL, 


DM00179 13.97 9.609e-09 334- 
344 


1035 


PR00988 


URIDINE KINASE SIGNATURE 


PR00988A 6.39 6.276e-12 358- 
376 


1035 


BL00567 


Phosphoribulokinase proteins. 


BL00567A 10.66 6.459e-ll 360- 
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379 


1038 


BL00I20 


Lipases, serine proteins. 


BL00120C 12.62 9.053e^l2 95- 
106 


1038 


PR00825 


VESPID VENOM ALLERGEN 
PHOSPHOLIPASE Al SIGNATURE 


PR00825B 14.81 L371e~10 83- 
104 


1038 


PR00821 


TRIACYLGLYCEROL LIPASE FAMILY 
SIGNATURE 


PR0082IE 9.64 4.896e-12 19-38 
PR00821F 16.10 L806e-09 1 18- 
134 


1044 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 5.500e-30 169- 
212 


1044 


BL00032 


'Homeobox* antennapedia-type protein. 


BL00032C 1 L28 5.179e-16 197- 
215 BL00032B 10.83 3.060e-15 
158-197 


1044 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 8.071e-13 201- 
211 PR00024B 11.27 Z000e-12 
191-202 


1044 


PR00026 


ENGRAILED HOMEODOMAIN 
SIGNATURE 


PR00026A 7.47 9.710e-t2 153- 
171 


1044 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.724e-10 192- 
209 


1044 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00025B 11.94 6.434e-10 155- 
171 


1048 


BL01128 


Shikimate kinase proteins. 


BL01 128 A 18,84 8.200e-14 7-41 


1048 


BL00211 


ABC transporters family proteins. 


BL0021 1 A 12.23 4.600e-09 9-21 


1049 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 1.783e-09 211- 
230 PR00014A8.22 3.04Se-09 
373-383 PRG0014C 15.44 6.087e- 
09 309-328 


1049 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL007901 20.01 L750e-12 642- 
673 BL007901 20.01 6.125e-12 
320-351 BL007901 20.01 6,679e- 
09 222-253 


1049 


PR00096 


GLUTAMINE AMINOTRANSFERASE 
SUPERFAMILY SIGNATURE 


PR00096B 9,72 9.827e~09 689- 
701 


1050 


DM00372 


CARCINOEMBRYONIC ANTIGEN 
PRECURSOR AMINO-TERMINAL 
DOMAIN. 


DM00372C 23.69 4.919e-12 67- 
103 


1053 


BL01282 


BIR repeat proteins. 


BL01282B30.49 LOO0e-ll 194- 
233 


1053 


PR00544 


PROGESTERONE RECEPTOR 
SIGNATURE 


PR00544F 7.76 3.329e-09 18-35 


1057 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464A 20.47 9.591e-16 149- 
170 PR00464C 18.84 l.OOOe-15 
324-353 PR00464D 17.40 6.250e- 
15 353-371 PR00464B 20.41 
1.844e-l 2 205-224 - 


1057 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385A 14.97 1.346e-12 335- 
353 PR00385B 10.22 4.130e-l 1 
353-367 


1057 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR00463E 17.37 4.814e-l 1 344- 
371 


1058 


PD02382 


RECEPTOR CHAIN PRECURSOR 
TRANS ME. 


PD02382A 17.43 9.321e-09 99- 
115 


1060 


BL00795 


Involucrin proteins. 


BL00795C 17,06 6,442e-10 905- 
950 


1060 


BL00422 


Granins proteins. 


BL00422C 16,18 4.255e~10 910- 
938 BL00422C 16.18 4.353c-09 
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913-941 


1060 


PF00992 


Troponin. 


PF00992A 16.67 2.184e-09 900- 
935 PF00992A 16.67 5.382e-09 
889-924 


1060 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 7.429e-09 883- 
934 


1060 


PF01140 


Matrix protein (MA), p!5. 


PF01140D 15.54 8.326e-09 903- 
938 


1063 


DM00215 


PROLINE^RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 603- 
636 


1063 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.643e-10 96-1 1 1 
PR00049D 0.00 2.525e-09 95-1 10 
PR00049D 0.00 3.898e~09 1 14- 
129 PR00049D 0.00 7.407e-09 
97-112 


1063 


PR00239 


MOIXUSCAN RHODOPSIN C- 
TERMINAL TAIL SIGNATURE 


PR00239E L58 8.670e-09 100^ 
112 


1064 


BL01282 


BIR repeat proteins. 


BL01282B 30.49 1.000e-l 1 137- 
176 


1064 


PR00544 


PROGESTERONE RECEPTOR 
SIGNATURE 


PR00544F 7.76 3.329e-09 18-35 


1065 


BL0I282 


BIR repeat proteins. 


BL01282B 30.49 1.000e-ll 187- 
226 


1065 


PR00544 


PROGESTERONE RECEPTOR 
SIGNATURE 


PR00544F 7J6 3,329e-09 18-35 


1066 


BL00218 


Amino acid permeases proteins. 


BL0021 8D 21.49 7.324e-l 1 226- 
271 BL00218E 23.30 3.475e-09 
307-347 


1067 


BL00994 


Bacterial export FHIPEP family proteins. 


BL00994A 15,15 I.086&-09 71- 
118 


1068 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 8.385e-33 645 


1068 


BL01030 


RNA polymerases M / 15 Kd subunits 
proteins. 


BL0I030 23.44 7.480e-10 283- 
321 


1068 


BL00466 


TFIIS zinc ribbon domain proteins. 


BL00466 25.88 5.622e-09 283- 
320 


1068 


PDQOQ66 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 2.385e-15 271- 
284 PD00066 13.92 3.077e-15 
243-256 PD00066 13.92 3.077e- 
15 320-333 PD00066 13.92 
3.077c-l 5 348-361 FD00066 
13.92 3.077^15 376-389 
PD00066 13.92 3,077e-15 432- 
445 PD00066 13.92 3.077e-l5 
617-630 PD00066 13.92 3.077e- 
15 701-714 PD00066 13.92 
7.923e-l 5 215-228 PD00066 
13.92 8.200e-l4 589-602 
PD00066 13-92 8.800e-14 729- 
742 PD00066 13.92 5.71 4e- 12 
542-555 PD00066 13,92 9.571e- 
12 561-574 PD00066 13,92 
3.739e-ll 404-417 PD00066 
13.92 L692C-10 299-312 
PD00066 13.92 2.038e-10 488- 
501 PD00066 13.92 S.6l5e-10 
645-658 PD00066 13.92 6.700e- 
09 757-770 
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NO: 


Database 
entry ID 


Description 


Results * 


1068 


BL00028 


Zinc finger, C2H2 type, domain proteitfe. 


BL00028 16.07 5.950e-13 388- 
405 BL00028 16.07 6.400e-13 
444-461 BL0002S 16.07 1.692e- 
11 227-244 BLO0028 16.07 
3.423e-I I 332-349 BL00028 
16.07 6.538e-ll 685-702 
BL00028 16.07 7.23 le- 11 713- 
730 BL00028 16.07 7.577e-ll 
jfj-'jyv oLUuUZo 10.U/ 4.JU0e- 

10 601-618 BLO0028 16.07 
5.500e-10 17M88 BL00028 
16,07 5.800e-I0 255-272 
BL00028 16.07 7.900e-10 657- 
674 BL00Q28 16.07 9.700e- 10 
526-543 BL00028 16.07 2.029e- 
09283-300 BL00028 16.07 
3.829e-09 741-758 BL00028 
16.07 6.914e-09 360-377 
BL00028 16.07 7.686e-09 416- 
433 


1068 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.750C-14 570- 
584 PR00048A 10.52 2.929e-l3 
329-343 PR00048A 10.52 5.500e- 
13 224-238 PR00048A 10.52 
1 .000e-l 2 598-6 1 2 PR00048 A 
10,52 5,235e-l 2 252-266 
PR00048A 10.52 5.765e-12 441- 
455 PR00048A 10.52 7.353 e- 12 
385-399 PR00048A 10.52 7.353e- 
12 413-427 PRO0048A 10.52 
7.353e-l 2 710-724 PR00048A 
10.52 3,842^11 357-371 
PR00048A 10.52 5.263e-ll 626- 
640 PR00048A 10.52 5. 737c- 11 
280-294 PR00O48A 10.52 7.632e~ 
11 682-696 PR00048A 10.52 
8.579e-l 1 654-668 PR00048B 

Z. 1 X Jc- 1 V Q-J /-HO / 

PR00048A 10.52 2.565e-10 738- 

523-537 PR00048B 6.02 7.188e*- 
10 698-708 PR00048B6 02 
7.750e-l 0 726-736 PR00048B 
6.02 1.000e-09 240-250 
PR00048A 10,52 3,520e-09 308- 
322 PR00048A 10.52 4.600c-09 
551-565 PRQ0048A10.52 8.560e- 
09 196-210 PR00048B6.02 
1.000e-08 586-596 


1069 


PD01427 


TRANSFERASE METHYLTRANSFERASE 
BI. 


PD01427B 22.45 1.545e-ll 117- 
158 


1070 


PF00168 


C2 domain proteins. * 


PF00168C 27.49 1.750e-Q9 2Q2- 
228 


1070 


PR00360 


C2 DOMAIN SIGNATURE 


PR0O360B 13.61 2.227c-09 219- 
233 


1075 


PR00962 


LETHAL(2) GIANT LARVAE PROTEIN 
SIGNATURE 


PR00962D 10.40 3.054e-10 178- 
202 


1075 


PRD0320 


G-PROTEIN BETA WD-40 REPEAT 


PR00320A I6.74 2.976e-09 181- 



256 



WO 02/070539 



PCT/11S02/0S095 



SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 






SIGNATURE* 


196 


107S 


DM01970 


0 kw ZK632.12 YDR313C ENDOSOMAL 

in. 1 


DM01970B 8.60 2.478e-13 310- 
323 


1078 


BL00678 


Trp-Asp (WD)ifepeat proteins proteins. 


BL00678 9 67 1 474e-09 41-^7 


1078 


PR00320 


G-PROTEIN EETA WD-40 REPEAT 
SIGNATURE c 


PR00320A 16.74 L931e~U 214- 
229 PROO320B 12.19 3,829e-10 
214-229 PR00320C 13.01 3.880e- 
10 214-229 PR00320C 13.01 
4.900e^09 257-272 


1079 


PF00774 


Dihydropyridine sensitive L-type calcium 
channel (Beta subunt. 


PF00774D 10 59 8 ?9fifiJf)Q i^o. 
365 


1079 


BL01013 


Oxysterol-binding protein family proteins* 


632 


1080 


BL00615 


C-type lectin domain proteins, 


147 


1081 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 2.705e-l I 47-86 


1082 


BL00359 


Ribosomal protein LI 1 proteins. 


BL00359B 23.07 7.462e~24 160- 
201 BL00359C 22.18 6.586e-22 
215-249 BL00359A 20.66 
4.000e-21 124-160 


1082 


BL01108 


Ribosomal protein L24 proteins. 


BL01108A 20.33 1.000e-08 144- 
177 


1084 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 l,220e-09 104- 
139 


1084 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 7.000e- 17 755- 
772 BL00028 16.07 6.625e-15 
699-716 BL00028 16.07 8.412e- 
14 223-240 BL00028 16.07 
8,941e-14 167-184 BL00028 
16.07 6.850e-l 3 391-408 
BL00028 16.07 1.783e- 12 559- 
576 BL00028 16.07 2.957e-12 
307-324 BL00028 16.07 7.652e- 
12 503-520 BL00028 16,07 
7.652e-12 811-828 BL00028 
16.07 8.043e-1 2 335-352 
BL00028 16.07 L346e-ll 447- 
464 BL00028 16.07 2.385e-l 1 
867-884 BL00028 16.07 4.462e- 
11 671-688 BL00028 16,07 
5.846e-ll 587*604 BL00028 
16 07 6 199p-11 fttQufKfi 

BL00028 16.07 6.192e-ll 895- 
912 BL00028 16.07 8.962e-ll 
531-548 BL00028 16.07 L600e- 
10 279-296 BL00028 16.07 
1.600e-l 0 363-380 BL00028 
16.07 6.100e-t0 11M28 
BL00028 16.07 6.700e=10 643- 
660 BL00028 16.07 8,500e-10 
251-268 BL00028 16.07 L771e- 
09 783-800 BL00028 16.07 
5M6c-09 475-492 


1084 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A I0.52 4.600e-18 696- 
710 PRO0048A 10.52 5.09Ie-15 
164-178 PR00048A 10.52 6.727e- 
15 836-850 PR00048A 10.52 



257 



WO 02/070539 



PCT/US02/O5O95 



SEQ ID 

NO: 



Database 
entry ID 



1084 



Description 



PD00066 



PROTEIN ZINC-FINGER METAL-BINDL 



Results* 



1 .000e-l 4 220-234 PR00048A 
10.52 2.500e-14 444-458 
PR00048A 10.52 5.500e-14 556- 
570 PR00048A 10.52 9.250e~ 14 
388-402 PR00048A 10.52 1.643e- 
13 640-654 PR00048A 10.52 
3.57 le-1 3 304-318 PR00048A 
10.52 3.57ie-13 528-542 
PR00048A 10.52 8.071 e^l 3 332- 
346 PR00048A 10.52 8.07 le-1 3 
808-822 PR00048A 10.52 8.071e- 
13 864-878 PR00048A 10.52 
8.714e-13 500-514 PR00048A 
10.52 7.353e>-12 892-906 
PR00048B 6,02 ].000e-l 1 292- 
302 PR00048B6.02 1.000e-ll 
516-526 PR00048B 6.02 l.OOOe- 
II 824-834 PR00048A 10.52 
3.842e-l 1 276-290 PR00048A 
10.52 6.684e-ll 584-598 
PR00048A 10.52 9.053e-U 668- 
682 PR00048A 10.52 4.l30e-10 
360-374 PR00048A 10.52 6.870e- 
10 752-766 PR00048B6.02 
1.474e~09 768-778 PR00048B 
6.02 3.368e-09 236-246 
PR00048B 6.02 3,368e-09 460- 
470 PR00048B 6.02 4.789e-09 
152-162 PR00048B 6.02 4.789e- 
09 376-386 PR00048B 6.02 
4.789o-09 600-610 PR00048A 
10.52 4.960e-09 108-122 
PR00048A 10.52 4.960e-09 248- 
262 PR00048B 6.02 6.21 le-09 
264-274 PR00048B 6.02 6,21 le- 
09 488-498 PR00048B 6.02 
6.2 i le-09 796-806 PR00048B 
6.02 7.632e«09 404-414 
PR00048B 6.02 7.632e-09 712- 
722 PR00048A 10.52 8.920e-09 
780-794 



PD00066 13.92 1.600e-14 295- 
308 PD00066 13.92 L600e-14 
323-336 PD00066 13.92 L600e- 
14 519-532 PD00066 13-92 
1.600e-l4 547-560 PD00066 
13.92 1.600e^l4 827-B40 
PD00066 13.92 l.600e-I4 855- 
868 PD00066 13.92 5.200e-14 
351-364 PD00066 13.92 5,200e- 
14 575-588 PD00066 13.92 
8.200e~14 883-896 PD00066 
13.92 9-400e-I4 239-252 
PD00066 13.92 2.500e-13 155- 
168 PD00066 13,92 2.500e- 13 
379-392 PD00066 1 3.92 2.286c- 
12 267-280 PD00066 13.92 
2.286e-12 491*504 PD00066 
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SEQ ID 
NO: 


Database 
entry ID 


description 










13.92 2.286e-I2 799-812 * 
PD00066 13.92 2714e-12 463- fc 
476 PD00066 13.92 2.714e-12 | 
715-728 PD00066 13.92 2.714&- 1 

12 771-784 PD00066 13.92 g 
3.571c- 12 687-700 PD00066 & 

13 92 7 000e-I2 407-490 
PD00066 13 92 1 OOOe-lft 197- 
140 PD00066 13.92 1.000e-08 
603-616 


1085 


PR00679 


PROHIBITEN SIGNATURE 


PR00679F 8.03 6.478e-28 178- 
202 PR00679C 14 44 7 677e-22 
107-126 PR00679E 12.82 5 171e- 
19 153-173 PR00679D 11,91 
9,0S3e-l 8 130-147 PR00679G 
6.13 7.882e-17 201-218 
PR00679B 13.63 2.444e-10 84- 
104 


1086 


PR00245 


OLFACTORY RECEPTOR SIGNATURE 


PR00245E 12.40 8.286e-12 45-60 


1086 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237G 19.63 5.814e-09 26-53 


1087 


PD00126 


PROTEIN REPEAT DOMAIN TPR 
NUCLEA, 


PD00126A 22.53 6.885e-I0 99- 
120 


1088 


BL01145 


Ribosomal protein L34e proteins. 


BL0U45A 13,73 1.000e-12 3-45 


1093 


BL00154 


E1-E2 ATPases phosphorylation site proteins. 


BL00154D 12.57 7.387e-09 95- 
106 


1093 


PR00121 


SODIUM/POTASSIUM-TRANSPORTING 

ATPASF SlfrNATt FRF 

/ V JL JL JTk.IJ.Lj LT 1 V 1 ^ i^V 1 \J IV JL,* 


PR00121E 13,97 9.444e-09 92- 


1095 


PD00066 


PROTEIN ZINC-FINGER METAI^BINDI, 


PD00066 1 3.92 7.923e-l 5 439- 
452 PD00066 13,92 2.800e-t4 
41 1-424 PD00066 13.92 2.800e- 
14 467-480 PD00066 13.92 
5.800c-14 495-508 PD00066 
13.92 5.800e-14 523-536 
PD00066 13.92 8.200e-14 355- 
368 PD00066 13.92 5.500e-13 
579-592 PD0006613.92 3.I43e- 
12 551-564 PD00066 13.92 
4.857e-12 383-396 


1095 


BL00028 


Zinc fittper C2H2 tvne domain nrnteirw 


BL00028 16 07 4 150e-13 367- 
384 BLO0028 16.07 8.200e- 13 
563-580 BL00028 16.07 3.348e- 
12 479-496 BLO0O28 16.07 
7.652e=I 2 423-440 BL00028 
16.07 8.826e-12 619-636 
BL00028 16.07 4.1 15e-ll 451- 
468 BL00028 16.07 5.500e-l 1 
395-412 BL00028 16,07 7.23 1 e- 
11 591-608 BL00028 16.07 
1.600e-10 339-356 BL00028 
16.07 2.200e-l 0 535-552 


1095 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.375e-16 560- 
574 PR00048A 10.52 4.21 4e- 13 
476-490 PR00048A 10.52 6.143e- 
13 364-378 PR00048B 6.02 
6.400e-l 3 492-502 PR00048B 
6.02 1.000e-ll 352-362 
PR00048B 6.02 l,000e-ll 408- 
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SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








418 PR00048B &02 L692e-1 1 
548-558 PR00048A 10.52 1.947e~ 
11 588-602 PR0(»48A 10.52 
3.842e-ll 420-4311 PR00048B 
6.02 4.462e-U 4&1-474 
PR00048A 10.52 &684e-ll 392- 
406 PRO0O48A 10.52 6.684e-l 1 
448-462 PR00048B 6,02 7.23 le= 
11 436-446 PR00048A 10.52 
5.696e-10 532-546 PR00048B 
6.02 5.263e-09 576-586 
PR00048A 10.52 6.400e-09 504* 
518 PR00048A 10.52 6760e-09 
336-350 PR00048A 10.52 7.120e- 
09 616-630 


1095 


PD02462 


PROTEIN BOLA TRANSCRIPTION 
REGULATION AC. 


PD02462A 22.48 9-232e-09 472- 
507 


1097 


BL00649 


G-protein coupled receptors family 2 proteins. 


BL00649C 17.82 9.542C-12 400- 
426 


1097 


PR00249 


SECRBTIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00249C 17.08 8*839e-ll 402- 
426 PR00249A 15,88 7.851e-09 
330-355 


1097 


PR00248 


METABOTROPIC GLUTAMATE GPCR 
SIGNATURE 


PR00248E 17.85 9.366e-09 442- 
465 


1100 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.875e-09 886- 
902 


1101 


PR00205 


CADHERIN SIGNATURE 


PR00205B 1 1.39 5.655e-l6 255- 
273 PR00205A 14.73 1.000e-12 
180-196 PR00205B 11 19 4 977p- 
10 475-493 PR00205C 13.65 
9.438e-10 515-530 


1101 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 1.000e-40 148- 
196 BL00232A27.72 5.125e-25 
54-87 BL00232B 32.794.286e-l9 
257-305 BL00232C 10.65 7.429e- 
16 255-273 BL00232B 32.79 
1.500e-10 372-420 BL00232C 
10.65 6.538e-l 0 475-493 
BL00232C 10.65 7.632e-09 146- 
164 


1103 


BL00122 


Carboxylesterases type-B serine proteins. 


BL00122A 12.04 3 J52e-15 86- 
107 BL00122D 12.53 7.097e-I4 
197-213 BL00122B 16.84 1.346e~ 
13 148-159 BL00122C7.9I 
9.550e-10 168-179 


1105 


BL000I8 


EF-hand calcium-binding domain proteins. 


BL00018 7.41 2.800e-10 217-230 
BL00018 7.41 8.650e-10 133-146 


1105 


PR00450 


RECOVERIN FAMILY SIGNATURE 


PR00450E 12.14 3,438e-15 174- 
193 PR00450B 1 1.76 5.574e03 
82-102 PR00450D 16.58 6,71 4e- 
13 152-172 PR00450C 12.22 
6.864M2 128-150 PR<ft)450G 
15.33 6.591e^09 22«45 


1108 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 6.l43e-12 927- 
940 PD00066 13.92 7.000e-09 
343-356 


1108 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 3.250e-13 91 1~ 
928 BL0002S 16.07 9.100e-l3 
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Database 
entry ID 


Description 


Results* 






t 

t 

D 


327-344 BL00028 16.07 3.348e- 
12 939-956 BL00028 16.07 
5.500^11270-287 BL00028 
16.074.0006-10298-315 
BL00028 16.07 6.700e-10 968- 
985 BL00028 16.07 9.700e- 10 
191-208 BL00028 16.07 9.700e- 
10 355-372 BL00028 16.07 
4 857e-09 384-401 BL00028 
16.07 7. 171e-09 242-259 


1108 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048B 6.02 6.000e~12 924- 
934 PR00048A 10.52 4.522e^l0 
324-338 PR00048A 10.52 4.91 3e- 
10 908-922 PR00048A 10,52 
7.652c-l 0 936-950 PR00048A 
10.52 4.600c-09 965-979 
PR0004RA 10 52 7 480e-OO 159- 
366 


1108 


DM01970 


0 kw ZK632.12 YDR313C ENDOSOMAL 
IIL 


DM01970A 8.50 9.1006-09 910- 
917 


1109 


DM01857 


5 kw NUCLEOSIDE TRANSPORT 
DEPENDENT NA. 


DM01857B 14 94 6 471 e- 19 784- 
312 DM01857E 18.90 7.31 3e- 18 
488-527 DM01 857F 12.86 
7.045e-1 5 548-575 DM01857C 
15,62 4.500e-14 312-344 
DM01857A 20.25 l,667e-13 207- 
250 DM018S7D 16,80 3.165e-l2 
372-410 


1112 


DM01S40 


kw SPAG24B 1 1.09 R07E5. 13. 


DM01 840B 22.04 1.844e-40 59- 
103 DM01840A 10.95 9.57 1 e- 13 
31-43 


1114 


BL01115 


GTP-Mnding nuclear protein ran proteins. 


BL01115A 10,22 3.4386-1453-97 


1U4 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 9.308e-16 53-75 
PR00449C 17.27 8.920e-15 94- 
117 PR00449B 14.34 5,680e- 10 
76-93 


1114 


PR00879 


FISH ACETYLCHOLINESTERASE 
SIGNATURE 


PR00879A6.28 1.450e-09 37^43 


1114 


BL0U25 


ROK family proteins. 


BL01125D 13.61 7.429e-09 214- 
227 


1115 


PF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622B 21.00 2.500e-13 265- 
287 


1115 


BL00518 


Zinc Finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 9.571 e-10 103- 
112 


1116 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 11.19 5.050e-ll 66-80 
PR00019B 11.36 6.850e-10 63-77 


1116 


DM00315 


072 RIBONUCLEASE INHIBITOR. 


DM00315G 15.85 6.362e-10 84- 
120 DM00315G 15.85 3.340e-09 
246-282 


1117 


BL00509 


Ras GTPase-activating proteins. 


BL00509B 10.28 5.263e-10 429- 
440 


1117 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.357e-10 794- 
809 PR00049D 0.00 1 .915^09 
793-808 PRO0O49D 0.00 3.593e- 
09 792-807 PR00049D 0.00 
5J29e-09 791-806 


1117 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 7.559e-09 780- 
813 
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1117 


PR00806 * 


ViNCULIN SIGNATURE 


PR00806A 6.63 8.397e-09 794- 
805 


U20 


PR00720 | 

! 


MAMMALIAN LMW 
PHOSPHOTYROS1KE PROTEIN 
PHOSPHATASE SIGNATURE 


PR00720C 12.41 L099e-27 88- 
109 PR00720B 10,61 4.789e-20 
71-87 PR00720A 16.54 2.000e-17 
28-41 PR00720E 10.01 1.342e-16 
117-139 PR00720D 17,32 1.778e- 
15 110-127 


1120 


PR00719 


LMW PHOSPHOTYROSINE PROTEIN 
PHOSPHATASE SIGNATURE 


PR00719A 14.49 3.000e-23 9-27 
PR00719C 14.10 5.000e-l8 85- 
101 PR00719B 14.32 L346e-15 
52-69 PR00719D 17.52 9,654e-15 
110-124 


1121 


PD00131 


ATP-BINDING TRANSPORT 
TRANSMEMBR. 


PD00131B 34.97 7.987e-09 108- 
162 


1123 


BL00615 


C-lype lectin domain proteins. 


BL006I5A 16.68 9.526e-13 573- 
591 


1123 


PR00356 


TYPE 11 ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356C 1333 3.793e-10 591- 
609 PR00356D 13.09 5.038e-09 
619-636 


1123 


PR00439 


1 1-S SEED STORAGE PROTEIN FAMILY 
SIGNATURE 


PR00439C 15.32 9,217e-09 332- 
353 


1127 


BL00134 


Serine proteases, trypsin family, histidine 
proteins, 


BL00134A 1 1.96 5.781e-15 493- 
510 BL00134B 15.99 4.1 94e- 14 
675-699 


1127 


BL0O02I 


Kringle domain proteins. 


BL00021B 13.33 8.9S4e-12 493- 
511 


1127 


BL00495 


Apple domain proteins. 

i 


BL00495N 11.04 9.735e- 11 667- 
702 


1127 


PR00453 


VON WILLEBRAND FACTOR TYPE A 
DOMAIN SIGNATURE 


PR00453A 12.79 5.345e-I4 251- 
269 PR00453B 14.65 4.682e- 10 
290-305 


1127 


PR00722 


CHYMOTR YP SIN SERINE PROTEASE 
FAMILY (SI) SIGNATURE 


PR00722A 12.27 9J29e-l3 494- 
510 PR00722C 10,87 4.2736-11 
674-687 PR00722B 12,51 4.000e- 
09 554-569 


1127 


BL01253 


Type I fibronectin domain proteins. 


BL01253G 11.34 5.348e-09 674- 
6S8 


1128 


BL00236 


Neurotransmitter- gated ion-channels proteins. 


BL00236D 25.66 4.000e-30 64- 
106 


1128 


PR002S2 


NEUROTRANSMtTTER-GATED ION 
CHANNEL FAMILY SIGNATURE 


PR00252D 12.29 7.097e-!0 71-84 


1129 


BL00604 


Synaptophysin / synaptoporin proteins. 


BL00604F 5,96 7.718e-10 367= 
412 


1129 


PR00524 


CHOLEC YSTOKININ TYPE A RECEPTOR 
SIGNATURE 


PR00524F 5.36 7.415e-09 208- 
222 


1129 


BL01113 


CI q domain proteins. 

* 


BL01 1 13A 17.99 6.455e-t4 158- 
185 BL01113A 17.99 5.622e- 13 
28-55 BL01U3A 17.99 L923e- 
12 393-420 BL0U13A 17.99 
4.462e-l 2 230-257 BL01113A 
17.99 6.53 8e- 1 2 34-61 
BL01313A 17,99 I.000e-ll 284- 
311 BL01113A 17.99L205e-ll 
31-58 BL01113A 17.99 1.614c- 
11 170-197 BL01113A 17.99 
5.09Ie-ll 167-194 BL011I3A 
17.99 6,523e-l 1 456-483 
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BL01H3A 17.99 7.341&-11 411- 
438 BL01113A 17.99 1.766e-10 
402-429 BL01113A 17.99 
2.915e-10 356-383 BL01113A 
17.99 6. 745e-I0 239-266 
BL01113A 17.99 7.319e40 293- 
320 BL01113A 17.99 8.085e-10 
164-191 BL01113A 17.99 
8.660e-10 305-332 BL01113A 
17.99 2.038e-09 272-299 
BL01 1 13A 17.99 2.212e-09 353- 
380 BL01113A 17.99 2.385e-09 
86-113 BL01 11 3A 17.99 2.731 e- 
09 149-176 BL01113A 17.99 
2.904e-09 453480 BL01113A 
17.99 3.423e-0913M58 
BL01 1 13A 17.99 3,423e-09 308- 
335 BL01113A 17.99 3.769e*09 
396-423 BL01113A 17.99 
3J69e-09 450-477 BL01113A 
17.99 3.9426-09 25-52 
BL01 1 13A 17.99 3.942e-09 173- 
200 BL01113A 1 7,99 4. 1 1 5e-09 
278-305 BL01113A 17,99 
4.98 le-09 43-70 BL01I13A 
17.99 5.327e-09 19-46 
BL01 1 13A 17.99 5.500e-09 320- 
347 BL0IU3A 17.99 6.192e-09 
143-170 BL01113A 17.99 
7.23 le-09 224-251 BL01113A 
17.99 8.269e-09 245-272 



1129 



BL00420 



Speract receptor repeat proteins domain 
proteins. 



BL00420A 20.42 3.571e-13 31-60 
BL00420A 20.42 9.082e-13 113- 
142 BL00420A 20.42 8.691e-ll 
311-340 BL00420A 20,42 
4.098e-10 125-154 BL00420A 
20.42 4.541e-10 158487 
BL00420A 20.42 5.279e-10 34-63 
BL00420A 20.42 5.426e-10 137- 
166 BL00420A 20.42 6.754e-10 
49-78 BL00420A 20.42 6.902e- 
10 266-295 BL00420A 20.42 
7.492e-l0 43-72 BL00420A 
20,42 8.082*10 25-54 
BL00420A 20.42 9.852e-10 167- 
196 BL0Q420A 20.42 2.800&-09 
170-199 BL0042OA -20.42 
2.938e-09 41.4443 BL00420A 
20.42 3.4926-09 52-81 
BL00420A 20.42 5,015e-09 305- 
334 BL00420A 20.42 5.569e-09 
37-66 BL00420A 20.42 7.923e- 
09 459488 BL00420A 20.42 
8477e-09 335-364 BL00420A 
20.42 8.754e-09 83-112 
BL00420A 20.42 9.446e-09 287- 
3 1 6 BL00420A 20.42 9.862e-09 
290-319 
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1132 


PR00042 


FOS TRANSFORMING PROTEIN 
SIGNATURE 


PR00042E 9.69 7.652&09 234- 
258 g 


1135 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 9.400e|i4 365- 
378 PD00066 13.92 6.Rt3e-12 
335-348 PD00066 I3.&2J74e- 
1 1 395-408 *> 


1135 


BL00970 


Nuclear transition protein 2 proteins. 


BL00970B 10.09 3.069e-10 55-81 


1135 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.746e-09 223- 
238 PR00049D 0.00 3.746e-09 
224-239 PR00049D0.0O3.898e- 
09 220-235 


1135 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 6.885e^l I 349- 
366 BLOO028 16.07 5.886e-09 
379-396 


1135 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048B 6.02 6.21 te-09 362- 
372 


1135 


DM00215 


PROLINE-R1CH PROTEIN 3. 


DM00215 19.43 8.169e-09 221- 
254 


1136 


BL00962 


Ribosomal protein S2 proteins. 


BL00962D 22.51 5.500e-35 131- 
175 BL00962C 15.90 9.591e-t7 
106-124 BL00962B 36.15 9.060e- 
15 40-94 


1136 


PR00395 


RIBOSOMAL PROTEIN S2 SIGNATURE 


PR00395C 16.17 l,000e-17 106- 
124 PR00395D 13.04 7.000e-17 
131-149 PR00395F 10,56 6.400e- 
16 169-184 PR00395E 14.46 
4.103e-ll 148-160 


1137 


BL00152 


ATP synthase alpha and beta sub units 
proteins- 


BL00152A 15.38 5.109e-14 128- 
154 


1139 


BL00152 


ATP synthase alpha and beta subunits 
proteins. 


BL00152B 21.40 4.273e~37 124- 
162 BL00152A 15.38 8.364e-23 
67-93 


1139 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 Z862e~09 139- 
161 


1140 


BL00152 


ATP synthase alpha and beta subunits 
proteins. 


BL00152B 21.40 2.000e-32 185- 
223 BL00152A 15.38 8.364e-23 
128-154 


U40 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 7.672e-09 200- 
222 


1141 


PR00493 


BREAST CANCER TYPE I 
SUSCEPTIBILITY PROTEIN SIGNATURE 


PR00493G 7.57 l.l84e-10 652- 
673 


1141 


FDQ0Q78 


REPEAT PROTEIN ANK NUCLEAR 
ANK.YR. 


PD00078B 13.14 3.700e-10 494- 
507 


1141 


BL00518 


Zinc finger, C3IIC4 type (RING finger), 
proteins. 


BL00518 12.23 3.667e-09 102- 

in 


1141 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 9.047e-15 501- 
556 PF0079IB 28.49 9JS6e~ 14 
468-523 PF00791C 20.98 9.814e- 
10 515-554 PF00791C 20.98 
7.61 8e-09 482-521 


1141 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 3.500e-12 534- 
550 PF00023A 16.03 7.857e-ll 
501-517 PF00023B 14.20 9.591e- 
09 497-507 


1143 


BL00301 


GTP-binding elongation factors proteins. 


BL00301A 12.41 L750e-12 72-84 


1143 


PR00315 


GTP-B1NDING ELONGATION FACTOR 
SIGNATURE 


PR00315A 11.81 4.000e-14 72-86 
PR003I5B 11.66 7.600e-IO llS^ 
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: 


127 


1145 


BL00745 


Prokaryotic-type class I peptide chain release 
factors signat & 


BL00745C 13.66 7.398c-18 59- 
106 


1146 


BL00745 


Prokarvotic-tvoe class 1 Dentide chain relpnlfJ 
factors signat. § 


RT 0074*?^ HfiAd 7flrtp-17 ^0- 

106 


1149 


BL00660 


Rfind 4 1 Fnmilv rlnmAtn nvAfpIriQ ® 

AfJCHlU I 1(111111 J UlHllalll L71 VlAjlllvJt 


RT ftfl££nTl 17 11 A CAAa 17 11/: 

176 BL00660A31.50 7,911e-20 
RT 0fi6fiftr 71 J&lt* 

19215-259 BL00660E 23.41 
9.647e43 301-324 


U49 


PR00935 


BAND 4 1 PROTTTN FAMFI Y 
SIGNATURE 


PlJAftOQ^P 1 1 OJZ A 1f)f\t> 11 I^A 

175 PRO0935D 10.20 1.281e-I4 
12 14NI55 PR00935A 10.16 


1149 


PR00661 


ERM FAMILY SIGNATURE 


PR00661C 9.53 3.616e-10 150- 
172 


1153 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 1.882M2 155- 
174 


1153 


BL00107 


Protein kinases ATP-binding region proteins* 


BL00107A 18.39 1.818e-15 155- 

1Rfi RT nftlftTR 11 11 R 7I4a-1 1 
221-237 


1153 


BL00239 


Receptor tyrosine kinase class II proteins. 


BL00239B 25. 1 5 1 .774e-09 90- 

11R 

1 JO 


1153 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240E 11.56 6.657e-09 141- 

170 


1153 


BL00479 


Phorbol eiters / diaevlifrlveernl hindincr 
domain proteins. 


213 


1155 " 


PR00837 


ALLERGEN V5/TPX- 1 FAMILY 
SIGNATURE 


PRO0837C 17.21 4 f 064e-ll 155- 
172 PR 00817 A 1 & 77 4 Qfift>- 1 ft 
78-97 PR00837B 11.64 L310e-09 
133-147 


1155 


BL01009 


Extracellular proteins SCP/Tpx-l/Ag5/PR~ 
1/Sc7 proteins. 


BL01009D 14.19 8.759e-12 156- 
177 BL01009C 10.54 l.730e-09 
133-147 


1155 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199F I6.I99.483C-09 113- 
140 


1156 


BL0IU5 


GTP-binding nuclear protein ran proteins. 


BL0 1 II 5A 1 0.22 8.909e- 1 3 4-48 


1156 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 2.059e-19 4-26 
PR00449C 17.27 1.000e-18 44-67 
PR00449B 14,34 6.727e-ll 27-44 


1159 


BL00175 


Phosphoglycerate mulase family 
phosphohistidinc proteins. 


BL00I75D 27.67 4.000e-40 367- 
419 BL0OI75C 23.75 6.870e-28 
316-348 BL00175A 15.42 
8.200e-l 9 252-272 BL00175B 
12.60 8.714e-l 7 299-312 


1159 


BL00300 


SRP54-type proteins GTP-binding domain 
proteins. 


BL00300B 20.56 7.554e-ll 38-84 


1160 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 1.000e-14 547^ 
560 PD00066 13.92 2.200e-l4 
153-366 PD00066 13.92 3.400e- 
14241-254 PD00066 13.92 
6.400e-14 325-338 PD00066 
13.92 1.500e-l 3 297-310 
PD00066 13.92 6.500e-13 465- 
478 PD00066 13,92 7.500e-13 
437-450 PD00066 13,92 8.5O0e- 
13 409-422 PD00066 13.92 
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_L. - 


2,71 4e-l 2 269-282 PD00Q66 
13.92 3.571e~l2 381-394 
PD00066 13.92 7.577e-10 519- 
532 


1160 


BL00028 


D 


BL00028 16.07 3.647e- 14 531- 
548 BL00028 16,07 9.471c~14 
309-326 BL00028 16.07 L900e- 
13 559-576 BL00028 16.07 
7 750e-H 477-494 Rt OftfY?R 
16*07 2. 1 74e~I2 337-354 
BL00028 16.07 6.478e-12 225- 
242 BL00028 16.07 8.043e-i2 
421-438 BL00028 16.07 9.21 7e- 
12 365-382 BL00028 16.07 
2.038e-ll 253-270 BL00028 
16.07 7.231e-ll 281-298 
BL00028 16.07 6.100e-10 449- 
466 BL00028 16.07 8.800e~10 
503-520 BL00028 16.07 5.371e- 
09393-410 


1160 


PR00048 


C2B2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 7,429e-13 418- 
432 PR00048A 10.52 2,588e« 12 
362-376 PR00048A 10.52 3.647e~ 
12 556^570 PR00048A 10.52 
8.41 2e-12 474-488 PR00048A 
10.52 9.47 le-l 2 222-236 
PR00048A 10.52 2.421e-U 306- 
320 PR00048B 6.02 4.462e-l 1 
350-360 PR00048A 10.52 5.737e- 
11 250-264 PR00048A 10.52 
7.632e-l 1 528-542 PR00048A 
10,52 8.579e-H 278-292 
PR00048A 10.52 9.053e-l I 446- 
460 PR00048B6.02 9.308e-ll 
544-554 PR00048B 6.02 9.308e- 
11 572-582 PR00048B6.02 
].000e-10406-416 PR00048A 
10.52 2.565e-10 334-348 
PR00048B 6.02 4.938e-10 378- 
388 PR00048A 10.52 5.304MO 
390-404 PR00048B 6.02 8.31 3e- 
10 238-248 PR00048B 6.02 
1.947e-09 434-444 PR00048B 
6.02 2.421e-09 462-472 
PR00048B 6.02 3.368e-09 322- 
332 PR00048B 6.02 3.842e-09 
266-276 PR00048B 6.02 6.21 le- 
09294-304 


1161 


PD02331 


CYCLIN CELL CYCLE DIVISION PROTE. 


PD02331C 13.84 1.913e~ll 9-36 


1161 


BL00048 


Protamine PI proteins. 


BL00048 6.39 3.700c-09 165-192 
BL00048 6.39 4.938e-09 28 1 -308 


1161 


DM01206 


CORONA VIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 9.328e-ll 169- 
189 DM01206B 10,69 1.247e-l0 
248-268 DM01 206B 10.69 
7.78Ie-I0 200-220 DM01206B 
10.69 6.582e-09 246-266 


1163 


BL00239 


Receptor tyrosine kinase class II proteins. 


BL00239B 25.15 3.91 5e- 15 100- 
148 
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1163 


J&00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 6.362e-13 165- 
184 


1163 


i00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479C 12.01 1.000e-lO 208- 
22J 


1163 


B600107 
0 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.250e^26 165- 
196 BL00107B 13.31 3.647*09 
230-246 


1164 


BL01013 


Oxysterol-binding protein family proteins. 


BL01013D 26.81 9J35e-22 501- 
545 BL01013A 25.14 4.600e~14 
220-256 BL010I3C 9.97 4.906e- 
12 330-340 BL0IO13B 11.33 
3.01 7e-U 287-298 


1167 


BL00289 


Pentaxin Family proteins. 


BL00289A 30.36 6.850e-26 25-56 
BL00289E 18,00 6,684e-14 78-93 


1167 


PR00895 


PENTAXIN SIGNATURE 


PR00895A 14.53 L563e-15 48-63 
PR00895G 14,55 5.846e-12 75-87 


1167 


PR00468 


PLANT LIPOXYGENASE SIGNATURE 


PR004681 13,42 9.870e-09 59-74 


1168 


PR00217 


43 KD POSTSYNAPTIC PROTEIN 
SIGNATURE 


PR00217C 10.91 7,527e-09 547- 
563 


1169 


PR00756 


MEMBRANE ALANYL DIPEPTIDASE 
(Ml) FAMILY SIGNATURE 


PR00756D 10.58 l,529e-21 367- 
383 PR00756B 14.06 5.737e- 16 
253-269 PR00756A 12.90 l.237e- 
13 205-221 PR00756E J 1.91 
4.094e-13 386-399 PR00756C 
11.60 6.108e-ll 331-342 


1169 


BL00142 


Neutral zinc metallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 5.500e-10 367-378 


1170 


DM01688 


2 POLY-IG RECEPTOR. 


DM01 6881 14.97 6,279e-09 75- 
123 


1172 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308C 3.83 2.523e-10 40-50 
PR00308C3.83 8.892e-l 041-51 
PR00308C 3.83 8.892e-10 42-52 
PR00308B 4.28 6.671e-09 40-52 


1172 " 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 2.30 7.1 15e~09 30-45 


1179 


BLOO300 


SRP54-type proteins OTP-binding domain 
proteins. 


BL00300C 25.57 6,000e-09 215- 
269 


1180 


BL00514 


Fibrinogen beta and gamma chains C- 
terminal domain proteins. 


BL00514C 17.41 9.463e-19 233- 
270 BL00514E 14.28 7J50e- 12 
293-310 BL00514D 15.35 
9.824e-l 1 274-287 BL00514G 
15.98 4.273e-l 0 356-386 
BL00514H 14.95 6.217e-09 391- 
416 


1181 


BUM 158 


Macrophage migration inhibitory factor 
family proteins. 


BL01158A 21.81 4.130e-30 2-47 
BL01158B 17.07 4.3 16e-29 47-74 


1182 


BL00456 


Sodium :so!ute symporter family proteins. 


BL00456A 22.59 6.250e-40 46- 
101 BL00456C 24,55 6.586e-40 
184-239 BL00456B 18.94 8.125e~ 
25 122-152 BL00456D6.92 
5.500e-1 0 476-486 


1185 


PR00830 


END OPEPTIDASE LA (LON) SERINE 
PROTEASE (S16) SIGNATURE 


PR00830A 8,41 4.780e^l4 241- 
261 


1185 


PR00918 


CALICIVIRUS NONSTRUCTURAL 
POLYPROTEIN FAMILY SIGNATURE 


PR00918A 13.76 1.976e-10 231- 
252 PR00918A 13.76 2,084e-10 
495-516 


1185 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 


PR00300A 9.56 5.857e-12 237- 
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BINDING SUBUN1T SIGNATURE 


256 PR00300A 9.56 5.909e-09 
501-520 


1185 


BL00370 


PEP-utilizing enzymes phosphorylation site 
proteins proteins. 


BL00370A 5.71 6.294e-09 79-87 


1185 


BL00113 


Adenylate kinase proteins. 


BL00I13A 12.74 7.231e-09 502- 
519 


1185 


BL00674 


AAA-protem family proteins. 


BL00674D 23.41 2.286e-30 324- 
371 BL00674B4.461.205e^l7 
234-256 BL00674C 22.60 2.0S9e^ 
17 531-574 BL00674B4.46 
4,886e- 1 7 498-^520 BL00674E 
15,24 2.800e-l 5 402-422 
BL00674C 22.60 7.600e-09 270- 
313 


1185 


BL00870 


Chaperonins clpA/B proteins. 


BL00870A 1 L78 9.534e-09 210- 
257 


1185 


BL00675 


Sigma-54 interaction domain proteins ATP- 
binding region A proteins. 


BL00675A 24.86 9.775e-09 237- 
281 


1186 


PR00165 


ANION EXCHANGER SIGNATURE 


PR001651 10.02 8.412e-14 829- 
849 PR00165A9.84 6.423e-!3 
495-518 PR00165B 15.26 9.090e~ 
11 520-541 PR00165F 10.39 
6.663e- 10 639-658 


1186 


BL00291 


Prion protein. 


BL00291 A 4.49 9.675e-10 436- 
471 


1186 


BL00219 


Anion exchangers family proteins. 


BL002I9B 14.47 2.707e-24 296- 
340 BL00219C 17.29 5.426e-23 
341-380 BL00219K 12,73 
9. 100e-23 831-873 BL00219M 
9.989.299e-23 916-962 
BL00219H 10.06 5J05e>21 618- 
666 BL00219I 6 16 4 968e~20 
741-795 BL00219A 17.13 
7.833e-I9 122-154 BL00219E 
11.63 2.988e-l 6 485-525 
BL00219F 10.52 8.953e-14 525- 
549 BL002 1 9G 12.86 8.163e-13 
578-617 BL00219L 18,71 8.423e- 
13 873-912 BL00219N 10.66 
6.942e-12 978-1022 BL00219D 
l5J5 5.286e-Il 380-416 
BL00219O 14.02 3.377e-09 1023- 
1063 


1186 


PD01168 


SYNTHETASE LIGASE PROTEIN 
ALANYL. 


PD01 168L 9,47 7,833e-09 452- 
467 


1186 


DM01724 


lew ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 4.296e40 446-466 
DM01 724 8. 1 4 6.447e-09 442-462 
DM01724 8.14 7.987e-09 438-458 


1187 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 7.943e-14 65-108 


1187 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00O25B 1 1.944.000c- 11 51-67 


1187 


BL00032 


'Homeobox' antennapedia-type protein. 


BL00032B 10.83 7,660e-10 54-93 


1187 
1189 


PR00028 


POU DOMAIN SIGNATURE 


PR00O28D 17.92 2.286e-09 49-70 


BL00115 


Eukaryotic RNA polymerase II heptapeptidc 
repeat proteins. 


BL001 15Z 3.12 2,047e40 469- 
518 


1192 


BL01215 


Mrp family proteins. 


BL01215A 9.75 2>436e=09 466- 
493 


1192 


BL00567 


Phosphoribulokinase proteins. 


BL00567A 10.66 5.757e-09 470- 
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489 ? 


1192 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364^ 
486 


^8.19 7.341^09 470- 


1192 


BL0O113 


Adenylate kinase proteins, 


BL00113J 
489 


jl2.74 8.062e-09 472- 


1193 


PR003O3 


PREPROTEIN TRANSLOCASE SECY 
SUBUNIT SIGNATURE 


PR00303CW0.45 8.759e-09 88- 
111 


1197 


PF00429 


ENV polyprotein (coat polyprotein). 


PF00429 31 .08 8.01 5e- 1 6 41 5-465 


1198 


BL00415 


Synapsins proteins. 


BL004I5N 4.29 7.115e-l0 224- 
268 


1198 


PR00239 


MOLLUSCAN RHODOPSIN C- 
TERMINAL TAIL SIGNATURE 


PR00239E L58 1 .307e-09 253- 
265 


1198 


PR00211 


GLUTELIN SIGNATURE 


PR002UB 0.86 7.537e-l2 245- 
266 PR0021 IB 0.86 2.644e-10 
251-272 PR00211B0.86 4.083e- 
09 233-254 PR00211B 0.86 
7.583e-09 239-260 


1198 


DM00215 


PROLINE-RICW PROTEIN 3. 


DM00215 19,43 5.688e-12 227- 
260 DM00215 19,43 6*250e-12 
225-258 DM00215 19,43 5.235e- 
11 232-265 DM0021519.43 
5.941e-U 242-275 DM00215 
19.43 4.375e-l 0 236-269 
DM00215 19,43 4,857e-10 222- 
255 DM00215 19.43 5.179e~10 

230- 263 DM00215 19,43 8.554e- 
10 237-270 DM00215 19.43 
2.068e-O9 2 1 5-248 DM002 1 5 
19.43 3.898e-09 235-268 
DM00215 19.43 4.508e-09 240- 

Ot) 1 — v1\ Anns* ic rn c r** £ - i\t\ 

115 DMU0215 19.43 5.576C-09 

231- 264 DM00215 19.43 6.339e- 
09 220-253 DM00215 19.43 
9.847e-09 218-251 


1200 


BL00183 


Ubiquitln-conjugating enzymes proteins. 


BL00183 28.97 4.326e-22 81-129 


1202 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 4.214e-16 108- 
148 J5LUU23/C I j.ly j.Jzie-1 1 
245-272 BL00237B5.282.227e- 
09 182^194 


1202 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


Pn00217F n 57 4 9Sft_ 
275 PR00237E 13.03 1.000e-12 
174-198 PR00237G 19.63 7.469e- 
12 288-315 PR00237B 13.50 
7.207e~l I 75-97 PR00237C 15.69 
7.750e^09 122-145 


1203 


BL00237 


G=protein coupled receptors proteins. 


BL00237A 27.68 4.214e-16 108- 
148 BL00237C 13.19 3J23e-ll 
280-307 BL00237B 5.28 2.227e- 
09217-229 


1203 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237F 13.57 4.600e-13 285- 
310 PR00237E 13.03 1.000e-12 
209-233 PR00237G 19.63 7.469e- 
12323-350 PR00237B 13.50 
7.207M1 75-97 PR00237C 15.69 
7.750e-09 122-145 


1207 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259B 14.81 3.769c-21 50-77 
PR00259C 16.40 4.000e-20 77- 
106 PR00259A9.27 3.600e-16 
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12*36 


1207 


BL00421 


Transmembrane 4 family proteins. 


BL00421B 17,62 X261e-36 56-95 
BL00421A 11.79 8.313e-16 8-27 


1207 


PR00164 


ABC-2 TYPE TRANSPORT SY #EM 
MEMBRANE PROTEIN SIGNATURE 


PR00164D 13.90 1.486e-09 9-34 


1208 


BL00282 


Kazal serine protease inhibitors fafflily 
proteins. 


BL00282 16.88 7.207e-14 562- 
585 


1208 


BL002I6 


Sugar transport proteins. 


BL002I6B 27.64 3.250e-10 267- 
317 


1209 


PF00922 


Vesiculovirus phosphoprotein. 


PF00922A 19,17 7.724e-09 88- 
122 


1214 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 7J43e-10 17-32 


1214 


PD01351 


PROTEIN REPEAT NEUROFILAMENT 
TRIPL. 


PD01351R 13 7"> 9 

PD01351B 13.72 3 J58e-09 24-50 


1214 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 5.576e-09 5-38 


1214 


PR00910 


LUTEOV1RUS ORF6 PROTEIN 
SIGNATURE 


PR009IOA2.51 7.857e-09 17-30 


1215 


BL00612 


Osteonectin domain proteins. 


BL00612E 13.12 3,947e~ll 379- 
424 


1215 


BL00484 


Thyroglobulin type-1 repeat proteins proteins. 


BL00484C 17.01 3.854e-H 131- 
146 BL00484B 9.04 1.491e-10 
238-252 BL00484C 17.01 8.560e- 
10 258-273 BL00484B9.04 
3,850e-09 111-125 


1216 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223A 15.59 LOOOe-33 147- 

75-109 BL00223C24.79 3.928e- 
15 134-189 


1216 


PR00I96 


ANNEX FN FAMILY SIGNATURE 


PR00I96C 10.36 3.571e-25 151- 
173 PR00196A 11.16 7.300e-24 
84-107 PR00196B 10.68 4.808e- 

16 194-141 PROftlQtfA 11 Ifi 
6,236e-14 156-179 PR00196E 
9.191.000e-12 155-176 
PRO0196G 1 1.72 5.829e-l I 199- 
2J3 PR00196C 10.36 7.913e-ll 
79-101 


1216 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10.49 8.729e-l4 151- 
173 PR00201A 6.05 1.976e-13 
84-107 PR00201Gll.02 3.847e- 
12 155-182 PR0020IA6.05 
S.241e-12 156-179 PR00201H 
12.04 4.889e-10 199-213 


1216 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00J98D 7.65 7.787e-21 151- 
173 PR00198B 8.71 3.880e-17 
84-107 PR00198C 14.32 2.688e- 
11 124-141 PR00198G8.09 
7.033e-l0 155-176 


1216 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200E 10.001030e-19 151- 
173 PR00200G 9.43 5.546e-14 
155-182 PR00200B 7.39 4.653e- 
11 156-179 PR00200B739 
4.857e-10 84-107 PR00200H 
13.68 9.663e40 199-213 


1216 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202D 5.58 6.793e-14 151- 
173 PR00202G 8.01 5.545e^l3 
1 55- 1 82 PR00202B 1 1 .44 2.782e* 
10 155-179 PR00202B 11.44 
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> 


5,206e-09 83-107 


1216 


PR00197 


ANNEXIN TYPE I SIGNATURE 
D 


PR00197B 7.56 4.960e-29 84-107 
PR00197D 7,50 3.000e-26 151- 
173 PR00197A 8 68 7 577e-20 
35-51 PR00197C 7.50 l.OOOe-19 
124-141 PR00197F 9*03 7- 1 28e- 
10 155-176 PR00197D7.50 
6.250e-09 79-101 


1216 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR001Q9D S 6S 0 1 SI - 

f 1 771/ J.LJJ 7ki 7 / L" 1 / I J J — 

173 PR00199B6,86 2.915e-13 
84-107 PR00199B6.86 1.265e-U 
156-179 PR00199G9.094,351e- 
11 156-182 PR00I99D5.65 
3,64 le-09 79-101 PR00199C 
13,84 9.57Ie-09 124-141 


1217 


BL00223 


Annex ins rftneat ni"frfein<! rlnmAin nmin'tn^t 


238 BL0O223A 15.59 1.000e-33 
119-153 BL00223A 15,59 
1.435&-16 47-81 BL00223C 24.79 
3.928e-15 106-161 


1217 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10.49 8.729e-l4 123- 

56-79 PR00201G 1 1.02 3.847e-12 
127-154 PR0020IA6.05 8.24Ie- 
12 128-151 PR00201E 12.37 
3.3 1 7e-l 1 206-233 PR0020 1 H 
12.04 4.889e-l0 171-185 


1217 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198D7.65 7.787e-21 123- 
145 PR00198B 8.71 3.880e-17 
56-79 PR00198E 14,67 5.286e- 15 
206-233 PR00198C 14,32 2.688e~ 
11 96-113 PR00I98O8.09 


1217 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196D 21.86 LOOOe-27 206- 
233 PR00196C 10.36 3.57Ie-25 
123-145 PR00196A 11.16 7.300e- 
24 56-79 PR00196B 10.68 
4.808c-16 96-113 PR00196A 
11.16 6.236e-14 128-151 
PR00196E 9.19 1.000e-12 127- 
148 PR00196G 11.72 5.829c- 11 
171-185 PR00196C 10.36 7,913e- 
11 51-73 PR00196C 10.36 
8.750e- 10 282-304 


1217 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR002O2D 5.58 6.793c-14 123- 
145 PR00202G 8.01 5.545e-13 
127-154 PR00202E13.00 8.740e- 
11 206-233 PR00202B 11.44 
2.782e-10 127-151 PR00202B 
il.44 5.206e-09 55-79 


1217 


PR00197 


ANNEXIN TYPE I SIGNATURE 

* 


PR00197E 11.89 1.794e-32 206- 
233 PR00i97B7.56 4.960e-29 
56^79 PR00I97DZ5O3.000e-26 
123-145 PR00197A 8.68 7.577e- 
20 7*23 PR00197C7.50 1.000s- 
19 96-113 PR00197F 9.03 
7.128e-10 127-148 PR00197D 
7.50 6.250e-09 51-73 
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1217 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200E 10.00 5.030e-19 123- 
145 PR00200G9.43 5.546e-14 
127-154 PR00200B 7.39 4,653e- 
11 128-151 PROO200F 13.72 
7.094e-l I 206-233 PR00200B 

13.68 9.663e^l0 171-185 
PR00200E 10.00 8.8426-09 282- 
304 


1917 


FSWJyJ L?y 


AMTsfPYTlvT TVPP FIT <2inMATT IT> P 
/YlNlNllAIlN 1 I r a III oJAJlNAVl UlvJC 


rKuuiyyjj xco y.zy/e-iv 123- 
145 PR00199F 16.19 4.39le-15 
206-233 PR00199B 6.86 2.91 5e- 
13 56-79 PR00199B 6.86 L265e- 
11 128-151 PR001 990 9.09 
4.351e-ll 128-154 PR00199D 
5.65 3.641e-09 51-73 PR00199C 
13.84 9.571e-09 96-113 


iZlo 


T>T AAoO'l 


Antiexins repeat proteins domain proteins. 


BL00223A 15.59 1.000e-33 119- 
153 BL00223A15.591.435e-16 
47-81 BL00223C24.79 3,928e-15 
106-161 


1218 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PR00196C 10.36 3.571e~25 123- 
145 PROOl96AlU6 7.300e-24 
Stw9 rKUUlVoB 10.68 4.808e-lo 
96-113 PR00I96A 11.16 6.236e- 
14 128-151 PR00I96E9.19 
1.000e-12 127-148 PR00196G 
lL72 5.829e-ll 171-185 


1218 


PR00201 


ANNEXIN TYPE V SIGNATURE 


PR00201D 10,49 8.729e-l4 123- 
145 PR00201A6.05 1.976e-13 
56-79 PR00201G 11.02 3.847e- 12 
127-154 PR00201A6.05 8.241 e- 
12 128-151 PR00201H 12.04 

A ftSOo 1 n I'll 1 ft<? 


1218 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198D7.65 7.787e-2l 123- 
145 PR0Q198B 8.71 3.880c-17 
56-79 PR00198C 14.32 2.68$e^ll 
96-113 PR00198G8.09Z033e-10 
127-148 


1218 


PR00200 


ANNEXIN TYPE IV SIGNATURE 


PR00200E 10.00 5.030e-19 123- 
145 PR00200G9.43 5.546e-14 
127-154 PR00200B 7.39 4,653e- 
11 128-151 PR00200B739 
4.857e-10 56-79 PR00200H 13.68 
9.663c-10 171-185 


1218 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202D 5.58 6.793e-l4 123- 
145 PR00202G 8-01 5.545e-13 
127-154 PR00202Bll.44 2.782e- 
10 127-151 PR00202B 11.44 
5.206e-09 55-79 


1218 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197B 7.56 4,960e»29 56-79 
PR00197D 7.50 3.000e-26 123- 
145 PR00197A 8.68 7.577e-20 7- 
23 PR00197C7-50L0Q0e-I9 96- 
1 13 PR00197F 9.03 7.128e-10 
127-148 PR00197D 7,50 6.250e- 
09 51-73 
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1218 


PR00199 


ANNEXIN TYPE III SIGNATURE 


PR00199D 5.65 9.297e-17 123- 
145 PR00199B6.86 2.9I5e-13 
56-79 PR00199B6.86 1.265e-ll 
128-151 PR00199G9.09 4.351e- 
11 128-154 PR00199D 5.65 
3.641e-09 51-73 PRO0199C 13.84 
9.571C-09 96-113 


1221 


BL00086 


Cytochrome P450 cysteine heme-iron ligand 
proteins. 


BL00086 20.87 8«615e-27 423- 
455 


1221 


PR00465 


E-CLASS P450 GROUP IV SIGNATURE 


PR00465F 13.37 8.468e-12 393- 
412 


1221 


PR00359 


B-CLASS P450 SIGNATURE 


PR003591 11.13 Z261e-ll 433- 
445 


3221 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR004631 15.02 9.57 le-21 433- 
457 PR00463G 18.24 6,760e- 19 
388-413 PR00463E 17.37 6.595e- 
17 304-331 PR00463F 17.63 
7.56$e-1 2 347-366 PR00463B 
17.50 7.692e-12 79-101 
PR00463D 1402 8,875e-ll 284- 
302 PR00463C 12.85 6.932e-10 
171-190 


1221 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464G 12 41 2 588e~12 398- 
414 PR00464E 18.28 3,077e-10 
342-363 PR00464I 14.64 3.1 06e- 
10 433-457 PR00464H 13.32 
4.635e-09 420-434 PR00464D 
17.40 5,787e-09 313-331 
PR00464C 18.84 5.8O8e-09 2B4- 
313 


1221 


PR0O385 


P450 SUPERFAMILY SIGNATURE 


PR00385E 12.66 9.100e-14 433- 
445 PR00385A 14.97 5.696e-13 
295-313 PR00385B 10.22 6.400e- 
09 313-327 


1221 


PR00408 


MITOCHONDRIAL P450 SIGNATURE 


PR00408D 15.44 &831e-09 295- 
313 


1222 


PR00385 


P450 SUPERFAMILY SIGNATURE 


PR00385A 14,97 5.696e-13 295- 
313 


1222 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR00463B 17,50 7.692e^l2 79- 
101 PR00463D 14.02 8.875e-ll 
284-302 PR00463C 12.85 6.932e- 
10 17M90 


1222 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464C 18.84 5,808e-09 284- 
313 


1222 


PR00408 


MITOCHONDRIAL P450 SIGNATURE 


PR00408D 1 5.44 6.831e-09 295- 
313 


1223 


BL00477 


Alpha-2-macroglobulin family thiolester 
region proteins. 


BL00477A 13.50 9J82c-19 70-99 


1225 


BL00500 


Thymosin beta-4 family proteins. 


BL00500 9.77 2.565e-28 2-42 


1227 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 4.97Ie-10 231- 
246 PR00320C 13.01 8,200e-I0 
231-246 PR00320B 12,19 9.486e- 
10 231-246 PR00320B 12.19 
3.475e-09 188-203 PR0032OB 
12.19 4.600e-09 315-330 
PR00320C 13,01 4.900e-09 315- 
330 


1227 


PR00319 


BETA G-PROTEIN (TRANSDUCIN) 


PR00319B 1 1.47 9.143e-09 315- 
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JtVvSUICS 










1227 


BL00678 


Trp-Asp (WD) repeat proteins prjteins. 


BL00678 9.67 1.600e- 10 233-244 

t3i /inA7w o fn 1 nnn*i nc r x\n ioq. 
dLuuu/o 7.0/ l.uuue-uo o l /-jao 


1236 


PF00580 


UvrD/REP helicase. | 


PF00580D 13.15 8.920e-l3 670- 
867-886 PF00580F 8.62 9.438e- 

1 A QTl^QOfi 

1 u y 1 j-yzo 


1237 


PR000I9 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B 11.36 L000e-09 64-78 
PR00019A 1 1.19 8,000e-09 90- 
104 


1238 




rKU 1 blM KhrhA 1 UUMAIJN lrK 
NUCLEA, 


rDUUlzoA 21,3$ j.jULie-10 229- 

250 


1243 


DM01206 


CORONAV1RUS NUCLEOCAPSID 
FROTEM 


DM01206B 10.69 4.759e-09 464- 
484 


1243 


BL00315 


Dehydrins proteins. 


BLO0315A 935 1.000e-08 389- 
417 


1245 


BL00972 


Ubiquitin carboxyl -terminal hydrolases 
family 2 proteins. 


BL00972A 11.93 8,054ol5 191- 
209 


1246 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 9.500e-12 347- 
363 PF00023A 16.03 8.500e-10 
283-299 PF00023A 16,03 8.875e- 
10 184-200 


1246 


PF00791 


Domain present in ZO-1 and Unc5-Uke netrin 
receptors. 


PF00791B 28.49 1.989e-13 217- 
212 PF00791B 28.49 6,044e~13 
117-172 PF00791B 28.49 4.31 6e- 
12 184-239 PF00791B 28,49 
9,432e-l 2 250-305 PF00791B 
28.49 6.243e~l 0 84-139 
PF00791C 20.98 4.971e-09 98- 
137 


1246 


BL00906 


Uroporphyrinogen decarboxylase proteins. 


BL00906D 24.33 7J50e-09 212- 
256 


1248 


BL00415 


Synapsins proteins. 


D ¥ AA/I 1 Cf\ O Of> O OfVT™ Af\ IT A n 

BLUU415i^ o.Zy7e-U9 li-49 


1250 


oLUl 1 13 


Clq domain proteins. 


BLU1 1 lib lfi.2o ADVUe-lJ o41- 

o / / 


1252 


BL01248 


Laminin-type EGF-Hke (LE) domain proteins. 


BL01248 11.02 7.l71e-12 258- 
271 BL0I248 11.02 7.943 e- 12 

3ZJ-JJO 


1252 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 7.000e-l7 376- 
395 PR00011A 14.06 L000e-14 
"57^^95 PR(Wi1 1ft H OR 5 167p- 

J /XJ^Jj'tJ LlWJKJUllMD U,UO J.1U/C- 

14 376-395 PR0001 1C 24.25 
8.468e-l 4 395-424 PR00011D 
14.03 9J39e-09 249-268 


low 


JjJLaM/JOt- 


jcijujom? pruicins. 




1253 


PR00148 

X IVUu ITU 


ENOLASE SIGNATURE 


PR00148A 10.11 1.783e-18 44-59 


1255 


BL01153 


NOLl/NOP2/sun family proteins. 


BL01153D 19.69 8.322e-l4 102- 
128 BL01153C I3.676.507e-10 
51-65 


1256 


BL00892 


HIT family proteins. 

* 


BL00892B 16.86 1.000e-20 130- 
154 BL00892A 18.17 6.657e-20 
64-95 


1256 


PR00332 


HISTIDINE TRIAD FAMILY SIGNATURE 


PR00332B 13.62 3.000e-16 76-95 
PR00332C 7.37 4.600e-14 143- 
154 PR00332A 10J5 7.375e-12 
55-72 


1257 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 


PF00791B 28.49 4.146e-10 73- 
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receptors. ^ 


128 


1258 


BL00615 


C-type lectin domain proteins. 


BL00615B 12.25 5.200e- 12 166- 
180 


1259 


BL00071 


proteins. • 


HI nAfY7i Q n 1 nt\ t AAn« /in on 
dLVvv/ lt> zl./U i.UUUe-40 80- 

126 BL00071C 11.81 1.000e-40 

140-1 ol JbLAJUU/lL) \y.5y 

3.ll8e-25 184-239 BL00071E 

11.48 4.600e-24 308-329 

BL00071A 5.81 2.607e-14 5-17 


1259 


PR0007X 

A EWJv\J / O 


DEHYDROGENASE SIGNATURE 


PR00O/8B 7.45 3.250e-24 146- 
1 65 PR00078D 1 1 .49 2.800e-2 1 
23N249 PR00078E 10.50 Alle- 
le 271-287 PR00078A 10.38 
1.000e-15 111-125 PR00078C 
i5$y 6.21 Ie-I I 173-190 


1262 


PR00926 


MITOCHONDRIAL CARRIER PROTEIN 
SIGNATURE 


PR00926F 17.75 2.688e-10 15-38 
PR00926D 10.53 6.625e-10 21-40 


1262 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR I SIGNATURE 


PR00927E 14.93 6.l43e-10 44-66 
PR00927B 14.66 9.870e-10 265- 
287 PR00927B 14.66 5.6 85e-09 
46-68 


1262 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 6.250e-17 13-38 
BL00215A 15.82 1 ,600e- 1 5 230- 
255 BL00215A 15.82 5.974e-13 
108-133 BL00215B 10.44 7.600e- 
09275-288 


1263 

- 


PR00654 


ANGIOTENSINOGEN SIGNATURE 


PR00654A 15,64 1.540e-26 23-44 
PR00654D 10.48 3.538e-26 153- 
175 PR00654F 15.16 8.07 le-26 
255~275 PR00654E 9.81 2.241 e~ 
25 194-215 PR00654C9.50 
*>,DUUe-2i IID-IJS 


1263 


BL00284 


Serpins proteins. 


BL00284C 28.56 9.514e-2i 254- 
439-464 BL00284A 15.64 

8 147p-14 1 IS-1^7 RT fMY?iMn 
o. it /t. 1 * if 1 l,7 M ljf 

16,34 1.837e-12 361-388 
BL00284B 17,99 7.500e-ll 229- 
250 


1264 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20 89 8 071^-17 34 ^7 


1265 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20 89 7 600e-16 34-S7 


1266 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 




1267 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A ^0 89 9 400e-1fi 34- S7 


1268 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 

* , _ . _ .,. 


BL00290B 13.17 4.000e-21 282- 
300 BL00290A 20.89 4.600e-l6 
34-57 BL00290A 20.89 2.42 le- 
13 225-248 


1269 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.600e-16 34-57 


1271 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 8,071e-l7 34-57 


1272 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 4.600e-I6 34-57 


1273 


BL00290 


Immunoglobulins and major 


BL00290A 20.89 4.600e~l6 34-57 
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histocompatibility complex proteins. 




1274 


kPF00651 


TVTR /fltQn Itntru/n a<t UR-P/TtlA Hrtmain 

*-> * 4.' \ai;>v twiiuwu do v/ l ityj UOIIlcUIl 

proteins. 




1274 


&M00099 


4 kw A55R REDUCTASE TFRMVWAT 
DIHYDROPTERIDINE. 


310 


1274 


J. IWVJUl 


KFT fH RFPPAT WiNATf FRF 


PPfl/Wftl A 7< 7 7^n* 1 1 All 

486 PR0050 1 A 8.25 7.9556-09 

J.ZO*-J*fZ 


1281 


BL00972 


Ubiquitin carboxyl-termtnal hydrolases 

ffimilv 0 nmtpin<i 

ICllllJIJ £i JJiULC/lIta. 


BL00972A 11.93 3.919e-15 101- 

1 1 0 FlI AnG79G O A< 7 ?77a 1 ft 

180-190 


1285 


PF00789 


Domain present in ubiquitin-regulatory 
proteins. 


PF00789B 19.70 5.941 e-09 213- 
234 


1286 


PF00789 


Domain present in ubiquitin-regulatory 
proteins. 


PF00789B 19.70 5.941e-09 259- 
280 


1287 


PR00625 


DNAJ PROTEIN FAMILY SIGNATURE 


PR00625A 12.84 3.000e-19 19-39 
PR00625B 13.48 2.756e-17 47-68 


1287 


BL00636 


NNdnaJ domain proteins. 


BL00636A 8.07 7.600e-19 23-40 
BL00636B 15.1 1 6.870e-15 47-68 


1288 


BL00518 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL00518 12.23 5.714e-10 24^33 


1289 


PR00500 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR00500I9.22 U07e-31 2810- 
2833 PR00500G 3.68 I.087e-30 
2525-2548 PR00500H 17.80 
U07e-29 2662-2684 PR00500E 
6.99 1.106e~27 2350-2370 
PR00500F 9.44 1 .108e-26 2483- 
2503 




rrUUSUI 


PK.D domain proteins. 


PF00801B 23,63 9.2l7e-26 1055- 
1083 PF008O1A 13.49 6.276c-ll 
222-235 PF00801B 23.63 3.0870- 
10 719-747 PF00801B 23.63 
o.ouye-iu iojz-ioou 


1291 


BL00415 


Synapsins proteins. 


BL00415N 4.29 5,401e-09 136* 
Ian 


1292 


PD00930 


PROTEIN GTPASE DOMAIN 
ACTIVATION. 


PD00930B 33.72 2.800e-23 229- 
270 PD00930A 25,62 5.021e-l2 
125-151 


1292 


PF00620 


GTPase-activator protein For Rho-iike 
GTPases. 


PF00620B 14.20 7.000e-12 178- 
195 


1293 




TvnP: FT flhrnnpftin llflcrpn-HinH ma rlmnnirt 
* /p* ii iiui uiiCLiin \^yiia^tiJi"Uiiiuiiig uuinctin 

proteins. 


318 BL00023 24.31 5.320e-32 
223-260 BL00023 24.31 5.800e- 
29 340-377 


1293 


BL00I42 


Neutral zinc nietallopeptldases, zinc-binding 
region proteins. 


BL00142 $.38 1.000e~I2 398-409 


1293 


PR00138 


MATRIXIN SIGNATIIRF 


424 PR00138C 16.41 1.000e-29 
164-193 PR00I38B 15.82 5.875c- 
16141-157 PR00138A 15.14 
1.000e-15 94-108 PR00138E6.01 
8.472e-l 1431-445 


1293 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 1.000e-40 163- 
212 BL00024D 17.28 4.316C-36 
392-424 BL00024B 21.53 7.545e- 
33 115-149 BL00024F 11.30 
2.895e-l 8 486-507 BL00024A 
11.49 3-667e-l 2 94-105 
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BL00024G 13.31 4.857e-12 525- 
538 BL00024E7.58 2.263e-10 
431-445 


1293 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15.41 L771e-09 393- 
412 


1293 


PR00013 


FIBRONECTIN TYPE II REPEAT 
SIGNATURE 


PR00013C 12.29 7, 1 3 6e- 19 372- 
388 PR00013C 12.29 6.464e-l8 
313-329 PR00013C 12.29 5.5O0e^ 
16 255-271 PR00013B 14.75 
4.375e-12 355-368 PR00013A 
12.26 L973e-ll 344-354 
PR00013A 12.26 6.595e-ll 227- 
237 PR00013B 14.75 8.568e-10 
296-309 PR00013A 12.26 2.167e~ 
09 285-295 PR00013B 14.75 
2.588e-09 238-251 


1293 


BL00546 


Matrixins cysteine switch. 


BL00546B 20,1 1 3.368e-40 164- 
208 BL00546C 16,41 6.400e-31 
392-424 BL00546A 19*62 
6.523e-19 74-104 BL00546E 
10.23 7.947e-l 3 486-507 
BL00546F 12.40 5.339e-09 525- 
538 


1294 


BL00023 


Type II fibronectin collagen-binding domain 
proteins. 


BL00023 24.31 8.043e-34 281- 
318 BL00023 24.31 5.320e-32 
223-260 BL00023 24.31 5.800e- 
29 340-377 


1294 


BL00546 


Matrixins cysteine switch. 


BL00546B 20.1 1 3.368e-40 164- 
208 BL00546C 16.41 6.400e-31 
392-424 BL00546A 19.62 
6.523e-19 74-104 




r>r> t\Pi tlo 


MATRIXIN SIGNATURE 


PR00138D 16.56 5.500e-30 398- 
424 PR00138C 16.41 1.000e-29 
164-193 PR00138B 15.82 5.875e- 
16141-157 PR00138A 15.14 
1.000e-l 5 94-108 


1294 


BL00142 


Neutral zinc metal lopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 1.000e-12 398-409 


1294 


BL00024 


Hemopexin domain proteins. 


BL00024C 22.98 1.0006-40 163- 
212 BL00024D 17,28 4.31 6e-36 
392-424 BL00024B 21.53 7.545e- 
33 115-149 BL00024A 11.49 
3.667e-l 2 94-105 


1294 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR00480B 15,41 1.771e-09 393- 
412 


1294 


PR00013 


FIBRONECTIN TYPE II REPEAT 
SIGNATURF 


PR00013C 12.29 7.1 36e- 19 372- 
3Rft PRrtnfn \c to oo a A&Ae> 1 ft 

joq r tSXfvvl jl< I A,£y O.'J-O'+e-l t> 

313-329 PR00013C 12.29 5.500e- 
16 255-271 PR00013B 14.75 
4.375e-12 355-368 PR00013A 
12.26 1.973e-Il 344-354 
PR00Q13A 12.26 6.595e-ll 227- 
237 PR00013B 14.75 8.568e- 10 
296-309 PR00013A 12.26 2,l67e- 
09 285*295 PR00013B 14.75 
2.588e-09 238-251 


1298 


DM01354 


kw TRANSCRIPTASE REVERSE II ORF2. 


DM01354R 8.50 2.969e-22 2115- 
2145 DM01354S 11.61 L692e-I4 
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2145-21 66* 


1298 


DM01688 


2 POL Y4G RECEPTOR. 


DM01688IJ 13.44 8.244e-09 
1714-17371 




JUIVIUUI /V 


VIM A CP AT PUA ArVUTECTrVM T f'CT T 


UMUUI /y fuMI 3. /3/e-IU Io07- 
1817 DM0§1 79 13.97 7.1 5Se-10 
10774087 DM00179 13.97 
9.053e4 0 759-769 DM00179 
13,97 9,053e-10 1328-1338 
DM00179 13.97 4.130e-09 574- 
584 DM00179 13.97 4,130e-09 
1431-1441 DM00179 13.97 
o.8/0e»uy 17134723 DM00 1 79 
13.97 7.652e-09 850-860 
DM00179 13.97 8.435e-09 2089- 


1298 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240B 24.70 S.909e43 623- 
647 BL00240B 24,70 1.209e40 
1126-1150 BL00240B 24.70 
4.558e-10 124-148 BL00240B 
24.70 6.442e-l 0 529-553 
BL00240B 24.70 4.255e-09 1222- 
1246 BL00240B 24.70 8.468e-09 
995-1019 


1298 


PD02327 


GLYCOPROTEIN ANTIGEN PRECURSOR 
IMMUNOGLO. 


PD02327B 19.84 8.091e-09 1126- 
1148 PD02327B 19.84 9.31 8e-09 
1222-1244 


1298 


PD02870 


RECEPTOR INrERLEUKIN-I 
PRECURSOR. 


PD02870B 18,83 L200e40 1610- 
1643 PD02870B 18.83 7.400e40 
2081-2114 PD02870B 18.83 
7. 800e40 10694102 PD02870B 
18.83 5.21 3e-09 14234456 
PD02870B 18.83 6>649e-09 67- 

15184551 PD02870D 15.74 
8.564e-09 566-601 PD02870B 
18.83 9.52le-09 286-319 

PD07R7flB 1ft 5tt Q QOAt* 00 19^fi 
i uvj£.o i \jl> io, qj y,y\j*TV"Kjy ijlDo" 

1291 


1299 


BL00888 


Cyclic nucleotide- binding domain proteins. 


BL00888B 14.79 4.706e48 372- 
396 BLO0888A 18.03 L000e-08 
354-371 


1301 


PF00615 


Regulator of G protein signalling domain 


PF00615B 16.25 9.625e46 73-90 
PF0061 10 Ofi Q ?nfif-19 1 *n 
164 


1302 


BL00766 


Tetrahydrofolate 

dehydrogenasc/cyclohydrolase proteins* 


BL00766E 13.78 9.62Se-39 191- 
228 BL00766C 25.86 4.375e-31 
77-125 BL00766D 17 05 5 966e- 
25 152-182 


1302 


PR00O85 


TETRAHYDROFOLATE 
DEHYDROGENASE/CYCLOHYDROLASE 
FAMILY SIGNATURE 


PR00085E 15.79 7.000e-26 151- 
181 PR00085G 10.74 1.865e-22 
208-227 PR0008£C 15.23 6.182e- 
2147-69 PR00085D 15.02 
2.688e-20 924 13 PR00085F 9.77 
6,595e45 19N208 


1303 


BL00180 


Glutamine synthetase proteins. 


BL00180E 17.60 1.00Oe-4O 154- 
206 BL001 SOD 1 3.26 2. 1 74e-24 
119441 BL00180F 10.05 6.211e- 
17 218-231 BL00180G 10.20 
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e 


8.43 5e-l 7 307-322 BL00I80C 
12.14 4 600e-14 102-112 
BL00180B 18.03 4.971e-14 68-87 
BL00180A 13.20 5.065e-14 32-45 


1304 


BLOOI80 


Glutsixiinc synthetase proteffcis. 


BI 001 RDF 10 OS fi 7Sfip-t 40^7 


1306 


BL01131 


Ribosomal RNA adenine dimethylases 
proteins. 


BL01 131 A 26.62 1.000e-08 77- 
123 


1308 


DM0019I 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN, 


DM00191A 8.16 5.440e-09 36-49 


1309 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN, 


DM00191A 8.16 5.440e~09 61-74 


1310 


PFOO606 


Herpesviral Glycoprotein B. 


PF00606I 20.74 7.894e-09 264- 
316 


1310 


BL01219 


Ammonium transporters proteins. 


BL01219D 1 1.63 2.957e-10 217- 
241 BL01219F 15.24 8.809e-09 
335^360 


li i\) 


JrKUUi4Z 


T1T_TT" , C , T TC 1 T>T f\f\T\ /"llTi /*vf f T\ hn atp t> t 

RHESUS BLOOD GROUP PROTEIN 
SIGNATURE 


PR00342G 8.18 L458e-19 220- 
239 PR00342B 11.09 8.657e-l3 
61-79 PR00342D 8.46 2.857e-12 

1 ^\ €\ t A Tin ini a /\ ******* 

1 28*1 45 PR003421 4.99 6.0 1 6e- 
12 285-299 PR00342H7.61 
O.yz/e-1 1 zjU-2/3 FR00342C 
10.10 4.770e-10 90-108 

175 PR00342F7.02 1.556e-09 

1ft*I_7Al PP OCYXAO f R O'? *7 Q/IA Q 

ioj-zui rivuuj'tiij o.v/ /.y^ue- 
09 308-327 PR00342L 7.61 
9.600e-09 398-424 


1311 


PR00209 


ALPHA/BETA GLIADIN FAMILY 
SIGNATURE 


pijfifi^noTi a ftft o asa^i i en oo 

rIVvl/jtU7£) t.OO y.l/OuvJ'-i 1 Ov"77 

PR00209B 4.88 6.967e-10 86-105 


1311 


DM00406 


GLIADIN. 


DM00406 7.73 1.400e-09 86-99 


1311 


PR 00501 


KFI CH RFPFAT STfrNATl FRF 


PRAA^AIR 1 ft lift ft lylOfi AO. /l/*A 

rKIflDU lis J d.oo o.j*f^e-uy 440- 
455 


1312 


PR00528 


GLUCOCORTICOID RECEPTOR 
SIGNATURE 


PR00528F 9.13 9.063e-09 31-51 


1313 


PF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622C 12.62 6.625e-13 759- 

771 


1313 


BL0051S 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BL005I8 12,23 6.571e-10 30-39 
Rl 00^1 R 17 7^ 1 A7V7(* AO 7^<5 

365 


1314 


BLOO420 


Speract receptor repeat proteins domain 
proteins. 


BL00420B 22 67 7 S'Mp-TS 17 09 
BL00420C 11.90 9.250e~ 12 122- 
133 


1314 


PR00258 


SPERACT RECEPTOR SIGNATURF 


PRAfl2SRD 14 4.1 317** 1 1 Ofi 

113 PR00258B9.63 7.474e-ll 
52-64 PR00258E 13.33 1.750e-09 
121434 PR00258C 9.05 5.167c- 
09 67-78 


1315 


PR00080 


ALCOHOL DEHYDROGENASE 
SUPERFAMILY SIGNATURE 


PR00080A 9.32 8.548e-10 122- 
134 


1315 


BL00766 


Tetrahydrofoiate * 
dehydrogenase/cyclohydrolase proteins. 


BL00766C 25.86 7.632e-09 20-68 


1315 


PR00081 


GLUCOSE/RIBITOL DEHYDROGENASE 
FAMILY SIGNATURE 


PR00081A 10.53 2,452e-I3 41-59 
PR00081C 15.13 9.229e~09 167- 
184 


1317 


BL00263 


Natriuretic peptides proteins. 


BL00263 11.87 5.909e-22 129- 
147 
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1317 


PR00711 


ATRIAL NATRIURETIC PEPTIDE 
SIGNATURE 

1 


PR00711G 11.75 1.113e-30 128- 
151 PR00711B 10.71 7,545e«24 
32-51 PR00711D7.91 LOOOe-22 
72-90 PR007UF3,V7 2.463e-22 
109-128 PR0071 1C 5.6(5 7,81 8e- 
22 51-70 PR00711E6.39 LOOOe- 
21 92-109 PR00711A 12,00 
9,769e-20 11-30 


1317 


PR00713 


OTYPE NATRIURETIC PEPTIDE 
SIGNATURE 


PR00713C 14.14 L370e-13 130- 
146 


1317 


PR00710 


NATRIURETIC PEPTIDE FAMILY 
SIGNATURE 


PR00710A 10,90 3.250e- 14 127- 
137 PROO710B 11.08 L391e-12 
136-146 


1317 


PR00712 


BRAIN NATRIURETIC PEPTIDE 
SIGNATURE 


PR00712D 10.52 4.109e-12 128- 
139 FR00712E 10.62 7.231 e- 10 
138-152 


1318 


BL00609 


Glycosyl hydrolases family 32 proteins* 


BL00609C 13.27 9.270e- 11 249- 
261 


1318 


BL0I187 


Calcium-binding EGF-like domain proteins 
pattern proteins, 


BL01187B 12.04 6.53$e-16 757- 
773 Bl 01 187R 1? 04 7 7StV-14 
610-626 BL01187B 12.04 8.200e- 
14 651-667 BL01187B 12.04 
2 029e-10 571-539 RI01IR7A 
9.98 7.429e-10 591-603 


1318 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 11.29 6.301e-ll 753- 
770 PR00907B 11.29 2.636e-10 
647-664 PR00907B 11.29 3.524e- 
09 519-536 PR00907G 11.63 
4.243e-09 651-678 


1318 


BL01177 


Anaphylatoxin domain proteins. 


BL01 177C 17.39 8.286e-09 517- 
536 


1318 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 1.429e-09 762- 
773 PR00010C 11.16 8.500c-09 
528-539 


1318 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 LOOOe-08 619- 
626 


1319 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237E 13.03 4.000e-l0 26-50 


1320 


BL00125 


Serine/threonine specific protein 
phosphatases proteins. 


BL00125D 33.1 1 9.719e-35 23-78 


1320 


PR00114 


SERINE/THREONINE PHOSPHATASE 
FAMILY SIGNATURE 


PR001I4F 17.51 4.706e- 16 39-60 
PR00114G 17.20 5.421e-12 61-78 


1321 


BL00453 


FKBP-type peptidyl-prolyl cis-trans 
isomeraso proteins. 


BL00453B 23,86 6,538e-26 281- 
315 BL00453A 15.57 8.364e-12 
249-264 BL00453C 9.72 3.250e- 
11 323-336 


1321 


PR00280 


CHANNEL FORMING COLICIN 
SIGNATURE 


PR00280A 1 1.09 8.227e-09 284- 
300 


1322 


PR00497 


NEUTROPHIL CYTOSOL FACTOR P40 
SIGNATURE 


PR00497A6.92 8.261e-09 310^ 
328 


1322 


BL50002 


Src homology 3 (SH3) domain proteins 
profile. 


BL50002B 15,18 9.500c-09 45*59 


1323 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 8.269e-16 34^56 


1323 


BL011I5 


GTP-bmding nuclear protein ran proteins. 


BL01115A 10.22 2.474e-09 34-78 


1323 


PR00300 


AIT-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PR00300A 9.56 6.260e-09 36-55 
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jrnubpiioiiDuioKinase proteins. 


Ui rtrtc^TA ia r f n i aa. Art ne ca 

J3LUU567A 10.66 9.1O0e-09 35-54 


1324 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12,27 9.847e-10 314- 
333 




Jr Jr UU-t4Z 


DNA polymerase (viral) N-terminat domain 
proteins. 


PF00242G 13.52 6.276e-09 748- 
788 


I. J/-M- 


or Ant 0*7 


Protein kinases ATP -binding region proteins. 


BL00107B 1331 L000e-ll 381- 
397 BL00107A 18.39 8.091 e-09 
314-345 




t>t> nr\ t no 


1 YKUMNb KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12,27 9.847e-10 314- 
333 


1325 


PF00242 


DNA polymerase (viral) N-terminal domain 
proteins. 


PF00242G 13,52 6.276e-09 721- 
761 


1325 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107B 13.31 L000e-ll 381- 
397 BL00107A 18.39 8.091e-09 
314-345 


1326 


BL00472 


Small cytokines (intercrine/chemokine) C-C 
subfamily signatur. 


BL00472C 20.76 8.225e-09 50-87 


1327 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 8.667e-13 114- 
130 PR00705B 10.22 2.385e-10 
293-304 


1327 


BL00139 


Eukaryotic thiol (cysteine) proteases cysteine 
proteins. 


BL00139D 9.24 8. 125e-17 312- 
329 BL00139C9,23 2.800e-10 
292-302 BL00139B 10,19 7.600e- 
10157-166 BL00139A 10,29 
2.723e-09 114-124 


1328 


PR00705 


PAPAIN CYSTEINE PROTEASE (CI) 
FAMILY SIGNATURE 


PR00705A 10.55 8.667e-l3 155- 
171 PRO07O5B 10.22 2.385&-10 
334-345 


1328 


BL00139 


Eukaryotic thiol (cysteine) proteases cysteine 
proteins. 


BL00139D 9.24 8.125e-17 353« 
370 BL00139C 9.23 2.800e-10 
333-343 BL00139B 10.19 7.600e- 
10 198-207 BL00139A 10.29 
2.723e-09 155-165 


1330 


PD01270 


RECEPTOR FC IMMUNOGLOBULIN 
AFFIN. 


PD01270A 17.22 7.443e-10 129- 
169 PD01270A 17.22 7,387e-09 
36-76 


1332 


TIT AA J 1 ^ 

BL00412 


Neurornodulin (GAP-43) proteins. 


BL00412D 16.54 6.772e-10 250- 
301 


1332 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 2.068e-09 751- 
784 


1333 


BL00232 


Cadherins extracellular repeat proteins 
domain proteins. 


BL00232B 32.79 8.594e-35 151- 
199 BL00232B 32.79 5.579e~22 
260-308 BL00232A 27.72 
I .OOOc-20 57-90 BL00232C 10.65 
J.olJe- 14 258-276 BL00232B 
32 79 4 872e-l 1 3 77-49 ^ 
BL00232C 10.65 3.21 le-09 480^ 
498 


1333 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LL 


DM0 1724 8.14 9.1 13e-l0 698-7 1 8 
DM0I724 8.14 6.803e-09 694-714 


t833 


PR00205 


CADHERIN SIGNATURE 


PR00205B 1 1.39 4.545e-15 258- 
276 PR00205A 14.73 5.600e-09 
183-199 PR00205B 1 1.39 8.017e* 
09 480-498 


1335 


BL00214 


Cytosolic fatty-acid binding proteins. 


BL00214B 26.51 9.000e-29 47-92 
BL00214A 21,17 1.000e-24 6-32 


1335 


PR00178 


FATTY ACID-BINDING PROTEIN 


PR00178C 20.54 3.864C-25 65-93 
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SIGNATURE 


PR00178A 15.07 7.l88e-23 7-2§ 
PR00178D 13.52 6.170e-12 111- 
130 1 


1336 


PR00452 


SH3 DOMAIN SIGNATURE 


PR0O452B 1 1 65 8 250p-09 SOQ-ll 
525 | 


1338 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 1 1,93 8.759e-17 1 I2- w 
130 BLO0972D 22.55 8.1 16e- 12 
354-379 BL00972B 9.45 7.088e- 
09 193-203 


1340 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 3.l55e-09 1-44 


1340 


PR00633 


CHROMOSOME CONDENSATION 
"RFfiTIT A TOT? ^I<TNATln?P 


PR00633E 12.18 4.682e-10 182- 
lyy rKUUoJJO Ii.71 Ko67e-09 

is*; o(\a PRnn^Tirr k ?ai q^iq 
ioj-£\J t t rivuuojjri j j. iu j.yoJe* 

09 244-266 


1340 


BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625B 17.69 5.219e-15 1 79- 
213 BL00625B 17.69 9.1 94e- 14 

1*7*7 OI AfWKA i tL *ti 

/ / 1JLUUOZ5A 10.21 

16.21 5.500e-l 2 129-158 
BL00625A 16.21 7.203e-12 349- 
378 BL00625B 17.69 5.778e- 10 

193-1^7 RT nnA9*TO 1 7 AO < f\1A* 

09285-319 


1342 


BL00476 


Fattv acid desaturase^ familv 1 nrntpirK 




1345 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 9.690e- 11 292^ 
307 PR00320B 12.19 4.343e~ 10 
292-307 PR00320C 13.01 7.840e- 


1345 


BL00678 


Trp-Asp (WD) repeat proteins proteins. 


BL00678 9.67 1.000e-09 294-305 


1345 


BL00225 


Crvstallin^ beta, and panrtma 'fJreelr Itpv' mntiF 
proteins. 


RT 0099 ^R 1J3 ftA 7 77T*» 99 A ^11 

BL0022SB 18.06 5.673e44 97- 
1 32 RT 0099SA 1 3 Jt9 7 9 1 5te AO 
61-82 


1350 


PD01823 


PROTEIN INTERGENIC REGION ABC1 
PRECURSOR MITOCHONDRION T. 


PD01823D 16 66 3 093e-l5 91^19 
PD01823E 9.30 5.909e-15 75-88 


1352 


BL00540 


Ferritin iron-binding regions proteins. 


BL00540A 15.06 l.000e-40 9-50 
BL00540B 18.82 L000e-40 100- 
155 BL00540C 13.00 7.500e- 15 
165-177 


1353 


PR00294 


STREPTOMYCBS SUBTIUSIN 
INHIBITOR SIGNATURE 


PR00294A 10.44 6.444e-10 159- 
186 


1353 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN. 


DM00191D 13 94 7.167e-10 206- 
245 


1356 


BL00428 


Cell cycle proteins ftsW / rodA / spoVE 
proteins. 


BL00428A 14.30 3,613e-09 91- 
110 


1359 


BL00142 


Neutral zinc mctallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 7.18Se-tO 389-400 


1359 


PD01719 


PRECURSOR GLYCOPROTEIN SIGNAL 
RE, 


PD01719A 12.89 7.983e-16 550- 
578 PD01719B 9.30 1.750e-09 
877-885 PD0I719A 12.89 
3,000e-09 1006-1034 


1359 


PR00480 


ASTACIN FAMILY SIGNATURE 


PR0048OB 15.41 3.186e~09 384- 
403 


1360 


DM00191 


w SPAC8A4.04C RESISTANCE 
SPAC8A4.05C DAUNORUBICIN, 


DM00191D 13,946.330e-ll 232- 
271 DM00191D 13.94 7.728e-ll 
48-87 DM00191D 13.94 5.000e- 
10 1 12 151 DM00191D 13.94 



282 



WO 02/070539 



PCT/USO 2/05095 



SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








5.667e-10 5$98 DM00191D 
13.94 5.667eU0 123-162 
DM00191D f .94 6.583e-I0 56- 
95 DM0019ft 13.94 8.417e-10 
280-319 DMjpi91D 13.94 
5.9176-10 192=231 DM00191D 
13.94 1.391e-09 224-263 
0MOOJ91D 13,94 2.409e~09 208- 
247 DM00191D 13.94 4.835e^)9 
120-159 DM00191D 13.94 
5.304e-09 149-188 DM00191D 
13.94 5.461e-09 21 1-250 
DM001 91D 13.94 6.322e-09 80- 
119 DM00191D 13.94 7.652e-09 
243-282 DM001 91 D 13.94 
8.5 13e-09 216-255 DM00191D 
13.94 9.452e-09 177-216 


1360 


PF00624 


Flocculin repeat proteins. 


PF00624J6.21 3.496e-ll 237-292 
PF00624J6.21 6.597e-U 53-108 
PF00624J6.21 4.121 e-10 253-308 
PF00624J6.21 5.7l8e-10 141-196 
PF00624F 1 1.04 1.508e-09 50-86 
PF00624J6.21 3.163e-09 101-156 
PF00624J6.21 3.233e-09 165-220 
PF006241 9.10 5.18le-09 140-170 
PF00624F 1 1.04 6,008e-09 130- 
166 PF00624J 6.21 6.093e-09 
125-180 PF00624J 6.21 6,163e- 
09 221-276 PF00624G 10,91 
6. 806e-09 45- 1 00 PF00624G 
10.91 7. 169e-09 181-236 
PrU0oZ4O 10.91 7.387e-09 221- 
276 PF00624J6.21 8.674e-09 
197-252 PF00624J6.21 8.884e- 
09117-172 PF00624J6.21 

6.21 9.512e-09 55-110 


1360 


BL001 15 


PnlifirVfiHr RTvIA nnlvmpra^p TT hf»ntm-*f»r*tiHf» 

1_>UHUL J Wills XVLNiT. HHjIcIjC IX liCJJlClUCL/lIUC 

repeat proteins. 


BL001 15Z 3.12 7.618e-09 36-85 
BL0O115Z3,12 9.6O3e-09 241- 
290 


1363 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 1.321 e^09 1 10- 
126 


1363 


PF00791 


Domain present in ZO-1 and Unc5-Iikc nctrin 
receptors. 


PF0079IB 28.49 7,527e-13 110- 
165 PF00791B 28.49 6.119e-09 
77-132 PF00791C 20.98 7.529e- 
09 91-130 


1366 


PF00168 


C2 domain proteins. 


PF00168C 27.49 9.250e-17 320- 
346 


1366 


PR00399 


SYNAPTOTAGMIN SIGNATURE 


PR00399A 9.52 U44e-14 148- 
164 PR00399C 12.82 8.071e-14 
218-234 PR00399& 14.27 2.853e- 
13 163 177 PR0O399D 14.48 
i.871e-ll 238-249 


1366 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.759e-12 337- 
351 PR00360A 14.59 3,903e-10 
308-321 PR00360B 13.61 4.789e- 
10 203-217 
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1367 


PF00168 


C2 domain proteins. 


PF00168C 27.49 9.250e-17 320- 


1367 


PR00399 


SVN^TOTAGMIN SIGNATURE 

! 


PR00399A 9.52 1.844c-14 148- 
164 PR00399C 12.82 8.071e-14 
218-234 PR00399B 14.27 2.853e- 
13 163-177 PR00399D 14.48 
1.871e-l 1 238-249 


1367 


PRO0360 


C2 DOMAIN SIGNATURE 


PR0036QB 13.61 8.759e-12 337- 
351 PR00360A 14.59 3.903e-10 
308-321 PR00360B 13.61 4.789e- 
10 203-217 


1368 


BL00086 


Cytochrome P450 cysteine heme-iron ligand 
proteins. 


BL00086 20.87 1.857e-20 444- 
476 


1368 


PR00464 


E-CLASS P450 GROUP II SIGNATURE 


PR00464I 14.64 4.375e-17 454- 
478 PR00464A20.47 9.59le«l6 
130-151 PR00464C 18.84 L.OOOe- 
15 305-334 PR00464D 17.40 
6.250e- 15 334-352 PR00464H 
13.32 8.941e-15 441-455 
PR00464F 15.23 9.654e-13 403^ 
419 PR00464B 20.41 l,844e-12 
186-205 PR00464E 18.28 7.907e* 
12 363-384 PR00464G 12.41 
8.412e-12 419-435 


i JOo 


nn (\f\ACc 
rKUU4oj 


Jb-CLASS P450 GROUP IV SIGNATURE 


PR00465H 17.76 6.586e-10 454- 
473 


1 

IjOo 




P450 SUPERFAMILY SIGNATURE 


PR00385A 14.97 l.346e-12 316- 
334 PR00385B 10.22 4,130e-l 1 
334-348 PR00385D 13.11 7.857e- 
10 445-455 PR00385E 12,66 
9.438e-l 0 454-466 


1368 


PR00463 


E-CLASS P450 GROUP I SIGNATURE 


PR00463G 18.24 3.605e-I4 409- 
4J4 r KUu463h 1 7.37 4. 8 1 4e** 1 1 
325-352 PR00463I 15.02 5.574e- 
09454-478 PR00463H 12,41 
7.158e-09 444-455 


1370 


BL00218 


Amino acid permeases proteins. 


BL00218D 21,49 9.757e-l 1 263- 
308 


1371 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.288e-09 35-50 


1372 


PR00380 


KINESIN HEAVY CHAIN SIGNATURE 


PR00380A 14.18 4.086e-22 84- 
106 PR003H0C 13.18 5.286e- 17 
240-259 PR00380D 9.93 7.698e- 

i f ZyU-312. rKUUi&UfcJ 12.04 
7.805e-l 4 207^225 


1372 


BL00411 


Kmesin motor domain proteins. 
* 


BL0041 1G 21.39 7.750e-25 241- 
283 BL00411C 15.04 2,500e-22 
84- 1 06 BT 004 11 H 1 ^ 66 Jt 

16289-320 BL00411E 10.43 
9.129e-I6 135-154 BL00411F 
1 4.77 9.795M6 198-223 
BL00411D 12.13 5.909e-09 1 14- 
125 


1373 


FR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 1.915e~09 590- 
605 


1373 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A U.19 5.800C-12 158- 
172 PR000I9B 11.36 1.000e-10 
130-144 PR00019A 11.19 8.826e- 
10133-147 PR000I9B 11.36 
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7.120e-09 106-120 


*1373 


PR0050O 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR00500B 7.74 7,821 e-09 250^ 
271 


§374 
1 


BL00411 


Klnesln motor domain proteins. 


BL0041 1H 15.66 7.81 le-22 79- 
110 BL004 11 G 21.39 8.683e-22 
31-73 


1374 


PR0038O 


KINESIN HEAVY CHAIN SIGNATURE 


PR0038OC 13.18 2.385e-16 30-49 
PR00380D 9.93 3.739e-16 80*102 


1376 


BL00790 


Receptor tyrosine kinase class V proteins. 


Bl 00790F 70 fifi7e 1 9 7*V7 
815 


1376 


PR0001O 


TYPE II EGF-LIKE SIGNATURE 


PR00010C 11.16 5.636e-10 423- 
434 PR00010C 11,16 8.07te-09 
148-159 


1376 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 8.200e-09 427- 
434 


1376 


PR00907 


THROMBOMODULIN SIGNATURE 


PR00907B 1 1.29 7.312e-10 224^ 
241 PR00907G 11.63 5*297e~09 
62-89 PR00907B U.29 8354e-09 
98-115 PR00907B 11.29 9.451e- 
09334-351 


I j / 0 


■rt ni 1 st"i 

olwUl 10/ 


Calcium-binding EGF-like domain proteins 
paiiem proteins. 


BL01I87B 12,04 5.235e-15 62-78 
tJLOl 187B 12.1)4 5.7o5e-15 418- 
434 BL01187B 12.04 3.000e-12 

12 297-313 BL0I187B 12.04 

12.04 4.857e-10 378-394 

118 BL01187A9.98 6.571e-t0 
321-333 BL01187A9QJI S 17Sp- 
09 126-138 BL01187A9.98 
9.625e-09 362-374 


1377 


BL00048 


Protamine P 1 proteins. 


BL00048 6.39 4.038e-09 396-423 


1381 


BL00107 


Protein kinases ATP-blnding region proteins. 


BL00107A 18.39 8.500e-27 342- 
373 


1381 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109B 12.27 5,412e-12 342- 
361 


1381 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.143e-09 25^39 


1382 


PR0091O 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A 2.51 6.036e-09 48-61 


1388 


PR000I9 


LEUCINE-RICH REPEAT SIGNATURE 


PR0O0 19A 1 1.19 7.26 1 e- 1 0 69-83 
PR00019B 11.36 4.600e-09 66-80 


1392 


BL00018 


EF-hand calcium-binding domain proteins. 


BL000I8 7.41 6.870e-09 42-55 


1396 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790B 2L59 1.000e-40 61- 
113 BL00790C 16,65 1.000e-40 
165-219 BL00790K9.30 l.OOOe- 
40 657-711 BL00790Q 15.61 
l.OOOc-40 855-904 BL00790O 
7.68 5.929c-39 797-830 
BL00790G 22.06 5.1 14e-36 376- 
420 BL00790R 16.20 7.469e-36 
951-995 BL00790E 29.58 7.250c- 
35 273-321 BL00790J 14.21 
8.200e-33 605-645 BL00790N 
13.25 L214e-3 1 763-790 
BL0O79OI 20.01 L93le-29 501- 
532 BL00790D 12.41 2„500e^27 
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243-268 BL0079QH 13.42 
6.478e-27 455-481 BL00790M 
8.74 8.683e-25 741-763 
BL00790P 12.33 3.755e-24 830- 
855 BL00790F 15.90 5.200e-24 
339-366 BL00790L 1 1.16 5.909e~ 
21 721-741 BL00790A 19.74 
1.964e-19 31-53 


1396 


BL00240 


Receptor tyrosine kinase class III proteins. 


BL00240F 17.74 9.500e^l6 789- 
837 BL00240E 11.56 i.439e-15 
736-774 BL00240G 28.45 
8.793e-l 5 836-889 


1396 


BL00107 


Protein kinases ATP -binding region proteins. 


BL00107A 18.39 3.647e~20 750- 
781 BL00107B 13,31 5.09 le- 1 3 
818-834 


1396 


PR00109 


TYROSINE KINASE CATALYTIC 
DOMAIN SIGNATURE 


PR00109D 17.04 9.IO0e-22 819- 
842 PR00109E 14.41 7,429e-19 
863-886 PR00109B 12.27 5.125e- 
18 750-769 PR00109A 15.00 
2.895e-13 713-727 PR00109C 
12.85 5.235e-l 2 800-811 


1396 


BL00239 


Receptor tyrosine kinase class 11 proteins. 


BL00239E 17.14 5.426e-27 790- 
840 BL00239B25.I5 3.000e-23 
684-732 BL00239F 28.15 8.132e- 
21 844-889 BL00239D 16.81 
2. 143e~J 0 762-788 BL00239C 
18.75 3.348e-l 0 737-760 


1396 


BL50001 


Src homology 2 (SH2) domain proteins 
profile. 


BL50001B 17*40 2.7 He- 11 747- 
768 BL50001D 11.00 7.300e-10 
818-829 BL50001C 10.17 l.OOOe- 
09 799-810 


1396 


PR00014 


FIBRONECTIN TYPE III REPEAT 
SIGNATURE 


PR00014C 15.44 8.07le-I3 490- 
509 PR00014B 14.77 3.400e- 10 
467-478 PR00014D 12.04 6.824e- 
10 508-523 PR00014A8.22 
3.455e-09 342-352 


1401 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 2.000e-l 1 84-127 


1 Af\1 




Kinesin light chain repeat proteins. 


BL01 160B 19*54 5.958e-09 387- 
441 


1404 


BL01113 


Clq domain proteins. 


BL01113B 18.26 2.500e-l3 841- 
877 


14Uo 


oLUlZUo 


Amiloride-sensitive sodium channels 
proteins. 


BL01206D 30.58 3.025e-28 363- 
412 BL01206G 21.72 6.063e-27 
5J0-57G BLOlzUor 16.40 7.643e- 

1 Z AQS. CA/T UT A1 OA£t? in TO 

12 30 3 455e-I2 333-352 
BL01206B 13.56 1.205e-10 313- 
327 


1408 


BL01220 


Phosphatidylethanolamine-binding protein 
family proteins. 


BL01220B 16.65 1.000e-40 59- 
100 BL01220C 14.75 5.846e-34 
100-128 BL0I220A 22.62 
3.400e-31 21-52 


1409 


BL00815 


Alpha-isopropylmalate and homocitrate 
synthases proteins. 


BL00815C 21.36 3.1 lSe-09 786- 
815 


1412 


PR00049 


WILMS TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 4.051e-09 1-16 


1412 


PR00806 


VINCULIN SIGNATURE 


PR00806B 4.28 9.640e~09 3-17 


1418 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 5.200e-09 453- 
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__A . . 








400 


1418 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00QC8 16,07 7.882e-l4 524- 
541 B|00028 16.07 8.269e-l I 

BL00028 16.07 2.543e- 
09 43719-54 BL0002S 16.07 
4.600e-T)9 408-425 BL00028 
16.07 6.657e-09 465-482 


1418 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 3.160e-09 521- 
535 PR00048A 10.52 4.960e-09 
434-448 PR00048A 10.52 6.760c- 
09 552-566 PR00048A 10.52 
7.840e-09 462-476 


1419 


BL00022 


EGF-like domain proteins. 


BL00022A 7.48 5.000e-09 177- 
184 BL00022A 7.48 5.000e-O9 
241-248 BL00022A 7.48 S.OQOe- 
09 49-56 


1419 


PR00011 


TYPE III EGF-LIKE SIGNATURE 


PR00011D 14.03 5.696e-09 182- 
201 PR00011D 14.03 6.478e-09 
86-105 PR0001 ID 14.03 9.087e- 
09118-137 


14.1 0 


HN/frtl ft AO 
LUV1U1 o*t4. 


I CJiLkUlAJaJQ-iillNDlNlj DUMA IN, 

BACTERIAL TYPE. 


DM01842 11.31 9.922e-09 94-141 


1/111 

14/1 


r KUU J / 1 


*LAVOrR01JbiN PYRIDINE 

NUCLEOTIDE CYTOCHROME 
f? Pr^f iota, err cinMA Tr run 


PR00371D 14.55 4.536e-ll 385- 
405 


1421 


PR00406 


CYTOCHROME BS REDUCTASE 

olvjINAI UKJb 


PR00406D 10.02 6.538e-10 385- 

y* AC 

405 


1421 


PR00409 


PHTHALATE DIOXYGENASE 
REDUCTASE FAMILY SIGNATURE 


PR00409F 12.70 2.484e-09 385- 
405 


1421 


PR00466 


CYTOCHROME B-245 HEAVY CHAIN 

QTmsJATirPP 
oIvJlNrV 1 UlxJc. 


PR00466E 6.82 6.958e-17 386- 

a(\a Diinn/i/C/;/* 1 ia it o ^ a a ~ c\t\ 
4U4 rKUU40oL 10. 1 / 8.244e~09 

195-216 




DJLAJvl/iio 


j6iuc linger, uzjnui type, domain proteins. 


BL00028 16.07 4.462e-l 1 1087- 
1104 


1422 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 6.478e-ll 1075- 

1 ARB 


1422 


BL00319 


Amyloidogenic glycoprotein extracellular 
domain proteins. 


BL00319C 17.12 4.375e-10 1 154- 
1188 


1422 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 4.522e-10 1084- 
1098 PR00048B 6.02 L474e-09 
1072-1082 PR00048A 10.52 

O. /OUc-U:? l\JJO- l\J f\J 


1423 


PR00260 


BACTERIAL CHEMOTAXIS SENSORY 
TRANSDUCER SIGNATURE 


PR00260C 10.26 9.294e-09 L46- 
167 


1424 


BL00845 


CAP-Gly domain proteins. 


BL00845 16.43 6.442e-21 405- 
430 BL00845 16.43 9.820e-19 
901-99R 


1426 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 3.769e45 369- 
382 PD00066 13,92 4.462C-15 
285-298 PD00066 13.92 2.800e- 
14 257-270 W300066 13.92 
5.200M4 313-326 PD00066 
13.92 8.962c-10 341-354 


1426 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 5.050c-13 269- 
286 BL00028 16.07 5.050e-13 
297-314 BL00028 16.07 2.500e- 
10 325-342 BL00028 16.07 
5.200e- 1 0 353-370 BL00028 
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™ 1 

C2HMYFE ZINc|iNGER SIGNATURE 


16.07 7.000e~10 241-258 
BL00O28 16.07 9/700e-10 381- 
398 


1426 


PR00048 




PR00048A 10.52 8.5O0e-17 266- 
280 PR00048A 10.52 5.500e- 14 
294-308 PR00048A 10.52 4.706e- 
12 350-364 PR00048B 6.02 
6.000e- 1 2 3 1 0-3 20 PR00048B 
6.02 6.538e-H 394-404 
PR00048A 10.52 2*565e- 1 0238- 
252 PR00048B6.02 2.688e*10 
254-264 PR00048B 6.02 4.375e- 
10 338-348 PR00048A 10.52 
5.3 04e- 10 378-392 PR00048A 
10,52 9.609e~10 322-336 
PR00048B 6.02 5.263e-09 282- 
292 PR00048B 6 02 6.21 le-09 

17£ 


1429 


PR0067I 


INHBIN BETA B CHAIN SIGNATURE 


PR00671C 4.18 5.345e-09 9-29 


l *f j i 


U1V1U1 oUj 


1 T.TI7D "DT2C1/TD T TO fiT V/"* fl"D D/^TC TXT Tf 


niiilAI DAI A 1 /I f 1 £. Off. AA )%i '# 

PMO1803A 10.51 6.855e-09 215- 
236 


1431 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 7.400e-09 390- 

All 

43/ 


1432 


DM01803 


1 HERPESVIRUS GLYCOPROTEIN H. 


DM01803A 10.51 6.855e-09 251- 
272 


1432 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 7.4O0e-O9 426- 
473 




r JKUUjh-j 


Ivc I IINUJC ACID KiiCbl 1 UK 
SIGNATURE 


PR00545A 5.35 9,430e-09 383- 
398 




tJLAJ lZ5o 


viu A i/cuiy ramtly or nucleoside 
phosphatases proteins. 


BL01238A 1 1.72 7. 840e- 16 76-91 


1437 
l^J f 




IIYW I JLjIIN VJlrAac lJV/lVI AI1N 

ACTIVATION. 


rDtJuyJUo 55.11 z.K0Ue-2o 1256- 
1297 PD00930A 25.62 3.864e-13 

1 1 1 1 "70 


1437 


PF00620 


GTPase-activator protein for Rho-like 
GTPases. 


PF00620B 14.20 7.000e-l2 1205- 
1222 


1437 


PR00683 


SPECTRIN PLECKSTRIN HOMOLOGY 
DOMAIN SIGNATURE 


PR00683B 16.62 2.603e- 10 946= 
968 PR00683D 15.87 2.773e-09 
1005-1024 


1437 


PR00543 


OESTROGEN RECEPTOR SIGNATURE 


PR00543H 10.86 7.573e-09 556- 
576 


14.17 




jTJLAt* uuiuain proiGins ^/\iso Known aS UrXix 
or GLGF). 




1437 


BL00275 


Shiga/ricin ribosomal inactivating toxins 
proteins signatu- 


BL00275A 12.16 7.677e-09 1226- 
1240 


1441 


BL00223 


Annexins repeat proteins domain proteins. 


BL00223B 28.47 I.000e-40 140- 

iy\J i5LAt\j£.Z.5\^ £S\.I7 

217-272 BL00223A 15.59 
5.500e^32 21-55 BL00223A 
15.59 4.783e-l4 230-264 
BL00223C 24.79 2.515e-10 8-63 
BL00223A 15.59 6,250e-10 71- 
105 


1441 


PR00199 


ANNEXIN TYPE ni SIGNATURE 


PR00199G 9.09 8,364e-21 239- 
265 PR00I99F 16.19 5.636e- 16 
158-185 PR00199D5.65 5.375e- 
14 25-47 PR00199B 6.86 1.574e- 
13 30-53 PR00199D 5.65 7.987e- 
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SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 






»- — — 

I 


13 234-256 PR00199H 12.62 
5.339e- 12 282-296 PR00199D 
5.65 9.276e-10 75-97 


1441 




/\1NINJgAI1nH lire IV MvJInAI UKb 


FKU0200F 13*72 l.lI8e-35 158- 
185 PR00200G 9.43 1.000e-34 
238-265 PR00200B 7.39 1.643e- 
29 30-53 PR00200H 13.68 
1 .766e-l 8 282-296 PR00200E 
10.00 6.16fe46 75-97 PR00200E 
10.00 2. 1 1 le-14 25-47 PR00200A 
4.93 2,I25e-14 5-16 PR00200C 
8.76 l.500e-12 54-63 PR00200E 
10.00 2.859e-ll 234-256 
PR00200G9.43 5.294e-ll 29-56 
PR00200D 10.01 9,722e-10 70^87 


1441 


PR00197 


ANNEXIN TYPE I SIGNATURE 


PR00197F9.03 5.250e-16 238- 
259 PR00197D 7.50 1.250e-15 
25-47 PR00197E 11.89 8.463e-14 
158-185 PR00197D 7.50 L542e- 
12 234-256 PR00197D 7.50 
5.451e-10 75-97 PR00I97B 7.56 
.2,206*49 3,0-53 


1441 


PR00198 


ANNEXIN TYPE II SIGNATURE 


PR00198G 8.09 7.943e-16 238- 
259 PR00198D7.65 2.271e-13 
234-256 PR00198D 7.65 9.894e- 
13 25-47 PR00198E 14.67 
6.361e-ll 158-185 PR00198H 
12.05 t.462e~l0 282-296 
PK0UI98B 8.71 9.357e-10 30-53 
PR00198D 7.65 4.845e49 75-97 


1441 


PR0020I 


ANNEXIN TYPE V SIGNATURE 


PR00201G 11.02 9.419c-26 238- 
265 PR00201A6.05 4.770e-16 
30-53 PR00201E 12.37 4J03e-15 
158-185 PR00201H 12.04 4.375e* 
14 282-296 PR00201D 10.49 
4.150e-10 75~97 PR00201G 11.02 
8.402c-l 0 29-56 PR00201D 10.49 
6.1796-0925=47 


1441 


PR00301 


70 KD HEAT SHOCK PROTEIN 
SIGNATURE 


PR00301D 15.51 7.395e-09 38-59 


1441 


PR00196 


ANNEXIN FAMILY SIGNATURE 


PRO0196D 21,86 3,032e~24 158- 

153 rKUUiyoc y, iy o.333e-23 
238-259 PR00196A 11.16 9.100e- 
21 30-53 PR00196F 13.89 
2.714e-15 266-282 PR00196C 
10.36 5.167e-15 25-47 PR00196G 
1 1.72 3.000e44 282-296 
PR00196C 10.36 7.344e-13 234- 
256 PR00196C 10.36 1.703c- 12 
75-97 PR00196G 11.72 9.21 7e-10 
207-221 PR00196F 13.89 4.1 88e- 
09 107-123 PR00196A 11.16 
7.840e49 80-103 


1441 


PR00202 


ANNEXIN TYPE VI SIGNATURE 


PR00202G 8.01 4.833e-28 238- 
265 PR00202E 13.00 4.643e-16 
158-185 PR00202D5.58 9.604e- 
13 75-97 PR00202B 11.44 
2.763e-l 1 29-53 PR00202H9.20 
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NO; 


I ■ '1 4 .J l^naA 

JL/dLHUHSc 

entry ID 


Description 


Results* 








4.740e-l I 282-296 PR00202D 
5.58 1.908e~09 25-47 PR00202G 
8.01 9.237e-09 29-56 


1444 


DM01513 


CAMP-DEPENDENT PROTEIN KINASE 

ft 13 fit If A TOT? V r'M A TTv! 


DM01513A 13.61 8.568e-14 15- 
56 






Thymidine kinase cellular-type proteins. 


BL00603C 30.02 1.0O0e-4O 152- 
207 BL00603A 20,71 4.500e-33 
63-96 BL00603D 10.53 5.091c- 
IS 217-232 BL00603B 11.39 
3.455c-15 132-147 


1446 


PD01922 


PROTEIN PHOSPHODIESTERASE 
HYDROJL 


PD01922B 21.83 7,328e-14 162- 
198 


1447 


BL00061 


Short-chain dehydrogenases/reductases 
family proteins. 


BL00061B 25.79 1.931e-13 99- 
137 


1448 


BL01160 


Kinesin light chain repeat proteins. 


BL01 160B 19.54 5.958e-09 64- 
118 


1449 


PF00856 


SET domain proteins. 


PF00856A 26.14 8,579e-l 1 5-42 


1 AAQ 


rrLKJOZo 


PHD-fmger. 


PF00628 15.84 5.500e-10 1 N26 


1452 


BL00030 


Eukaryotic RNA-binding region RNP-1 
proteins. 


BL00030B 7.03 3.400e-10 116- 
126 


1454 


PF00075 


RNase H. 


PF00075D 10.71 7.000e-ll 517- 
528 PF00075C 11,58 9.786e-ll 
484-496 PF00075B 12.56 4.073e- 
10 449-460 PF00075A 14.44 
2.143e-09 402-419 


1454 


PR00211 


GLUTELIN SIGNATURE 


PR00211B0.86 4.417e^09 138- 
159 


1456 


BL00262 


Insulin family proteins. 


BL00262B 16.89 8.286e-17 68-88 
BL00262A 12.48 4.600e-15 32-50 


1456 


PR00277 


INSULIN B CHAIN SIGNATURE 


PR00277A 14.82 2.421e-13 29-43 
PR00277B 12.79 2.350e-ll 43-56 


1456 


PR00276 


INSULIN A CHAIN SIGNATURE 


PR00276A 1 1.84 4.750e-13 69-79 
PR00276B 8,02 7,828e-10 78-88 


1457 


PR00213 


MYELIN P0 PROTEIN SIGNATURE 


PR00213E 5.51 9-775e-12 264- 
289 


1459 


BLO0856 


Guanylate kinase proteins. 


BL00856C 29.21 2.658e-26 539- 
587 BL00856B 9.61 2.946e-18 
511-532 


1459 


PR00452 


SH3 DOMAIN SIGNATURE 


PR00452B 1L65 2.750e-09 369- 
385 


1459 


PD00289 


PROTEIN SH3 DOMAIN REPEAT 
PRESYNA. 


PD00289 9.97 6.586e-09 298-312 






PDZ domain proteins (Also known as DHR 
or GLGF). 


PF00595 13.40 8.800e-09 295-306 


1461 


PR00475 


HEXOKINASE FAMILY SIGNATURE 


PR00475B 14.92 6.143e-26 186- 
L\l rK00475L 10. Uo 2.742e~22 
327-350 PR00475F 9.68 4.000e- 
20 407-430 PR00475A 14.06 
3.1 1 8e-19 118-135 PR00475C 
11.92 6.684e-l 9 239-256 
PRO0475G9.08 L692e-16 479- 
496 PR00475D 13.30 2.653e- 13 
262-277 PR00475G 9.08 2.650e- 
10 32-49 


1461 


BL00378 


Hexokinases proteins. 


BL00378C 16.14 1.000e-40 243- 
287 BL00378E 22.92 5.821e-40 
313-359 BL00378B 14.23 3.647e- 
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NO: 


entry ID 


Description 


Results* 








io GU ii< Til nnnoc o T7 

2.688e- 17 481-496 BL00378D » 
10.94 1.474e- 13 291-303 | 
BL00378A 19.01 8.694e-li 59-8W 


1464 


PR0G722 


CHYMOTRYPSIN SERINE PROTEASE 


PR00722A 12,27 8.448e~14 56-72 


1464 


BL00021 


Kiingle domain proteins. 


BL00021B 13.33 1.763e-13 55-73 


1 A&A 

I t t\J t T 




ViS bJbJKJINJb JrKJJl JbAMi rAMlLi 

SIGNATURE 


rKU0839B 1 1.20 4.945e-09 55-73 


i40<r 




Type I fibronectin domain proteins. 


BL01253E 16.01 6.381e-09 125- 
162 


1/1 Ad. 




Serine proteases, trypsin family, histidine 
proteins. 


BL00134A 11.96 3.813e-15 55-72 
BL00134B 15.99 7.200e-10 1 86- 
210 BL00134C 13.45 9.206e-O9 
219-233 


1466 


BL00291 


Prion protein. 


BL00291A4.49 9.379e-09 105- 
140 


1467 


PF00534 


Glycosyl transferases group I. 


PF00534B 14,47 9.5Sle^I2 398- 
422 


1468 


PF0H05 


emp24/gp25L/p24 family. 


PF01105B 25.12 2.868e-25 126- 
178 


1469 


PF0H05 


emp24/gp25L/p24 family. 


PF01l05B25J2 2,868e-25 151- 
203 


1470 


PR00305 


14-3-3 PROTEIN ZETA SIGNATURE 


PR00305A9.33 9.500e-36 37-67 
PR00305E 13.01 4.316e-32 177- 
204 PR00305D 16.34 3.647e-30 
150-177 PR00305F 15.95 1.964e- 
26 204-234 PR00305C 8.68 
3,182e-26 115-138 PR00305B 
9.99 4.857e^24 84-109 PR00305F 
15.95 8.975e-15 215-245 


1470 


BL00796 


14-3-3 proteins* 


BL00796C 17.44 1.000e-40 99- 
149 BL00796D 17.39 1.000e-40 
150-196 BL00796B 10.67 7.000e- 
3y3/"/iJ BLOO/yoh 14,15 
3.045e-33 198=234 BL00796A 

14.15 2.742e-11209-245 


1474 


PF00642 


slm ilar). 




1475 


PF00588 


SpoU rRNA Methylase family. 


PF00588B 17.18 8.200e-10 28U 


1476 


BLOU60 


Kinesin light chain repeat proteins* 


BL01 160B 19.54 5.653e-09 791- 
845 


1477 


PR00326 


GTP1/OBG GTP-BINDING PROTEIN 
FAMILY SIGNATURE 


PR00326A 8.75 4.255e-14 364- 
385 


1477 


BL00306 


Caseins alpha/beta proteins. 


BL00306B 8.28 L900e-09 557- 
568 


1477 


PR00318 


ALPHA G-PROTEIN (TRANSDUCIN) 
SIGNATURE 


PR00318E 7.23 5.320e-09 220- 
230 


1479 


BL01305 


moaA / nifB / pqqE family proteins. 


BL01305D 14.97 7.279e-09 7-22 


1480 


PR00918 


CALICIVJJRUS NON-STRUCTURAL 
POLYPROTEIN FAMILY SIGNATURE 


PR00918A 13.76 5.807e«09 458- 
479 


1480 


PR00674 


LIGHT HARVESTING PROTEIN B CHAIN 
SIGNATURE 


PR00674A 20.10 9.870e~09 133- 
154 


1481 


PR00171 


SUGAR TRANSPORTER SIGNATURE 


PR00171E 14.87 l.000e-08 73-86 | 
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SEQ ID 
NO: 


Database 
entry ID 


Dfjprinti ah 


KvSlllIS 


1482 


DM01418 


352 FIBRILLAR COLLAGEN 
CARBOXYL-TERMINAL. 


DM014I8A 20.83 5,650e-23 101- 
149 DM01418B 22.51 8.500e-ll 


1482 


BL00291 


Prion protein. 


BL0029M 4.49 9,349e-10 23-58 


1482 


BL01113 


Clq domain proteins. 


BL0111& 17.99 6.1 14e-l 138-65 
BL0I113A 17.99 2.9 15e- 10 35-62 
BL0H13A 17.99 6.538c-09 17-44 

TIT A 1 1 1 1 A 17 OO £ 7 1 On AO OO C£ 

oLuii ij/\ 1 f t yy o.44ze-uy Jz-jy 


1483 


DM01418 


352 FIBRILLAR COLLAGEN 


DM01418A 20,83 5.650e-23 117- 

IOj 1/lVlU l *t 1 olJ ZZ.DI o.JVUe-l 1 

182-224 DM01418C 20.48 
8 fi^p-l ft 757-704 


1483 


BL00291 


Prion protein. 


BL00291A 4.49 9.349e-10 23-58 


1483 


BLOI 113 




Rlfitli^A ivoo^ii/i/* ii ao /:« 
0L1V 1 1 1 j/v 1 / .yy o.ii*fC~lJl jo-oj 

BL0I113A 17.99 2.9I5e-10 35-62 

BL01113A 17.99 6.538e-09 17^44 

BL01113A 17.99 6.712e-09 29-56 

TK\ ftlt 13 A 1 7 00 R d49f.no 37 *0 


1484 


BL01052 


Calponin family repeat proteins. 


BL01052B 15.31 3.308e-l 1 30-56 


1484 


PR00888 


SMOOTH MUSCLE PROTEIN/CALPONIN 
FAMILY SIGNATURE 


PR00888C 12.27 2.141e-09 30-46 


1486 


BL00795 


Involucrin proteins. 


BL00795C 17.06 7.600e-09 239- 
284 




DL*VU*flJ 


Synapsins proteins. 


Or Artjl 1 /TV T j| Of! C\ Af\f\ Af\ Ol 1) 

BL00415N 4.29 9.409e-09 81 8- 
862 






ATP-de pendent serine proteases, Ion family, 
serine active sit. 


BL01046D 19.61 4.938e-35 452- 
493 BL01046C 17.03 9.581e-31 

29 331-377 


1490 


PR00830 


FNnOPFPTinA<5P T A (1 fYNH SFRITMP 
PROTEASE (SI 6) SIGNATURE 


punnjiiftn « rift 7 ^o*. 7 a 7^7 
787 PR00830A 8.41 7.545e-18 

37^»3Q*i PROflJnflF 1 3 04 R SOfte 

15 790-809 PR00830C8.47 
2 837e~13 737-757 PR00830R 
14.73 7.429e-13 654-671 


1490 


BL01128 


Shikimate kinase proteins. 


BL01 128 A 18 84 8 027e-12 371- 
405 


1490 


PR00300 


ATP -DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PR00300A9.56 L254e-10 37L 
390 


1490 


PR00819 


CBXX/CFQX SUPERFAMILY 
SIGNATURE 


PR00819B 10.83 2.350e-I0 370- 
386 


1490 


BL00674 


AAA=protein family proteins. 


BL00674B 4.46 8.071e^l0 368- 
390 


1490 


PR00364 


DISEASE RESISTANCE PROTEIN 
SIGNATURE 


PR00364A 8.19 1.818e-09 370- 

386 


1490 


BL00113 


Adenylate kinase proteins. 


BL00113A 12.74 7.369e-09 3 72- 
389 


1491 


BL00824 


Elongation factor 1 beta/betaVdelta chain 
proteins. 


BL00824B 9.21 2.338e-09 150- 
170 * 


1495 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 3.880e-1 1 47-65 
BL00615B 12.25 2.286e~t0 149- 
163 


1498 


PR00119 


P-TYPE CATION-TRANSPORTING 
ATPASE SUPERFAMILY SIGNATURE 


PR00119B 13.94 8.7I4e^l2 35-50 
PR00119E 8.48 7.716e-ll 420- 
440 
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entry ID 
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Results* 


149R 

1 TyO 




J-M-TR AN^PORTTMfi ATPATF (PRDTfYW 

PUMP) SIGNATURE * 


PRfYl f 1<\C O Qfl 7 (117^ t (\ &*}t\ 
rl\\J\JlZ,\JK* y.yV f .\Jj f C" * \j U— 

437 






E*i"Ljj£, All clijt/D JJilL/bpilUl jiaLIwll hlvS prOtClno. 

1 

D 


ri no i ^zin? on 17 ^ oi^* i o o^i 
ulaju i j*f ^u* j / J./,/ De- 1 y ZOJ- 

304 BL00154F 8.23 6.1 75e-19 

/l 17 441 RT fiftl 'lir' H QQ /I *i7£o 

13 31-50 BL00154D 12.57 


1499 


BL00039 


OPAF)^hnv subfamily ATPwflttnPnripnt 

helicascs proteins. 


522 BL00039A 1 8.44 8.548e-23 
145-184 BL00039C 15,63 8.500e- 
16277-301 BL00039B 19,19 

1 JH7(»-17 1Q1-717 


1499 


DM01537 


kw SK12W SK12 NUCLEOLAR HELICASE- 


DM01537B 2L63 8.990e-12 450- 
497 


1499 


PF0O27I 


Helicases conserved C-terminal domain 

proteins. 


PF00271 7.99 5.500e-10 507-515 


1501 


BL00115 


Eukaryotic RNA polymerase 11 heptapeptide 
repeat proteins. 


BL001 15Z 3.12 9.669e~09 116- 
165 


1502 


PF00168 


C2 domain proteins. 


PF00168B 1 1.83 8.000e-10 38-49 


1502 


PRO0360 


C2 DOMAIN SIGNATURE 


PR00360A 14.59 6.806c-10 43-56 
PR00360B 13.61 2.227e-09 67-81 
PR00360B 13.61 5.909e-09 223- 
237 


1503 


PF00791 


Domain present in ZO~l and Unc5-Iike netrin 
recepLors. 


PF00791B 28.49 4.165e-13 780- 
835 PF00791B28.49 6.767e-10 
888-943 PF00791C 20.98 8.059e- 
09 794-833 


1504 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 5.875e-10 437- 
453 PF00023A 16.03 7.000e-10 
563-579 PF00023A 16.03 8.500e- 
10 248-264 PF00023A16.03 
9.250e-10 95-Ill PF00023A 
16.03 3.250e-09 596-612 
PF00023A 16,03 3.893e-09 716-* 
732 PF00023A 16.03 6.786e-09 
62-78 PFO0023A 16.03 9.036e-09 
496-512 


1504 


PD00078 


REPEAT PROTEIN ANK NUCLEAR 
ANKYR. 


PD00078B 13.14 2.957e-09 88- 
101 PD00078B 13.14 5.696e-09 
556-569 PD00078B 13.14 9.217e- 
09 742-755 


1504 


PF0079I 


Domain present in ZO-1 and Unc5~liko netrin 
receptors. 


PF00791B 28,49 5.024e~15 215- 
270 PF00791B28.49 7.750e-!4 
62-117 PF00791B 28.49 9.795e- 
14 530-585 PF00791B 28.49 
9.505e-13 683-738 PF00791B 

PF00791B 28.49 2.636*1 1 716- 
771 PF0079IC 20.98 5.696z~ 11 
697-736 PF00791B 28.49 3359e- 
10 404-459 PF00791B 28.49 
5.369e- 10 248-303 PF00791B 
28.49 6767e-l 0 563-618 
PF00791C 20.98 8.052e-10 544- 
583 PF00791C 20.98 3.382e^09 
229-268 PF0079IB 28.49 7.275e~ 
09 37L-426 PF00791C 20.98 
9.912e-09 385-424 
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Results* 


1505 


BL00479* 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.71 4e^0? 143- 
159 


1506 


BL00479| 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479B 12.57 8.714e-09 167- 
183 


1507 


BL00518* 


Zinc finger, C3HC4 type (RING finger), 
proteins. 


BLG0518 12.23 5.286e-10 239- 
248 


1507 


BL01282 


BlR repeat proteins. 


RT 019R7R ^fl AQ 1 Ohfioi f\Q ion 

259 


1507 


BL00264 


pjftiirnnvnnnn v^ial hAFttifttif**; nrnt/»inc 

i^ioui iruy y\j\jny&wii lIVllilUllGo piUlulIls*. 


■RT (VYJAA ft QQ A QQAa fid 1A 1 i£0 

BhV\)ZO*f o.yo 4.oo4e-U9 34 1-368 


1510 


BL00122 


Carboxylesterases type-B serine proteins. 


BL00122G 1 1.67 2.500e-l 5 1 5-26 


1511 




T UTFOVIRTTS ORFfi PROTPIM 
SIGNATURE 


rKuuyiyA z.m i.ysoe-l 1 340- 
353 PR00910A2.51 !.986e-Il 
342-355 PR00910A2.51 t.986e~ 

1 1 ^*KH^D / rKUUVl OA Z.5 1 
9.778e^0 346-359 PR00910A 
2.51 K107O-09 338-351 
PR00910A2.51 3.464e-09 336- 
349 


1511 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 4.508e-09 324- 
357 


1512 


DM01970 


0 kwZK632.12 YDR313C ENDOSOMAL 
HI. 


DM01970B 8.60 8.475e-15 175- 
188 


1514 


PRO0833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H2.30 8.375e-l0 149- 
164 PR00833H2.302.846e-09 
147-162 


1514 


PR00308 


TYPE I ANTIFREEZE PROTEIN 
SIGNATURE 


PR00308A 5.90 9.630e-l 1 1 50- 
165 PR00308C 3.83 8.892e-10 
104-114 PR00308C 3.83 8.892e- 
10 105-115 PR00308C3.83 
8.892e-10 151-161 PR00308C 
3.83 8.892e-10 152-162 
PR00308C3.83 8.892e~10 153- 
loj FKOOjOoC 3.83 8.892e-10 
154464 PR00308C 3.83 7,545e- 

7.896e-09 150-160 PR00308B 

4.28 8.397e-09 150-162 

PR 00308 A 5 QA 0 A47*»-AQ 1 At 

i SWI\JD\JQr\ J.JU Jf.Vt /U Uy 11/ I** 

116 


1514 


PR00456 


RIBOSOMAL PROTEIN P2 SIGNATURE 


PR00456E 3.06 7.188e-l0 144- 
159 PR00456E 3 06 I 684e~09 
145-160 PR00456E 3.06 7.949e- 
09 97-112 PR00456E3.06 
9.430e-09 98-113 


1515 


PF00992 


Troponin. 


PF00992A 16.67 3.368e-09 448- 
483 


1521 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019A 11.19 7.333e-09 322- 
336 PR00019B 1 1.36 9.280e-09 
319-333 


1522 


BL00315 


Dehydrins proteins* 


BL00315A 9.35 7.197C-I0 93-121 


1524 


PD00930 


8ROTEIN GTPASE DOMAIN 
ACTIVATION. 


PD0093OB 33.72 4.240e~16 235- 
276 


1524 


PR00234 


HIV- 1 MATRIX PROTEIN SIGNATURE 


PR00234E 11.78 7.268e-09 361- 
375 


1525 


BL00183 


Ubiquitin-conjugating enzymes proteins. 


BL00183 28.97 8.338e-14 44-92 


1527 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B 11.36 6.850e-10 132- 
146 PR00019A 11.19 2.667e-09 
135-149 PR00019B U,36 9.640e- 
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Results* 








09 180-194 PR00019B 11.36 
L000e-08 277-291 


1529 


BL00625 


Regulator of chromosome condensation 
(RCC1) proteins. 


BL00625A 16,21 2,033e-16 567- 
596 BL00625B 17.69 4.205e- 12 
561-595 BL00625B 17,69 9.423e- 
11 93-127 BL00625B 17.69 
1.444C-1 0 152-186 BL00625A 
16.21 1.759e-10 99-128 
BL00625A 16.21 2.739e-09 515- 
544 BL00625B 17.69 3.1 72e-09 
43-77 BL00625A 16,21 4.170e- 
09 158-187 


1529 


PR00633 


CHROMOSOME CONDENSATION 
REGULATOR RCC1 SIGNATURE 


PR00633B 13.02 3.535e-09 561- 
575 PR00633A 9.32 6.260e-09 
527-544 PR00633F 10.03 7.949e- 
09 528-543 


1530 


BL00414 


Profilin proteins. 


BL00414A 13.85 6.344e-13 2-16 
BL00414E 15.46 6.283e-09 121- 
136 


1530 


PR00392 


PROFIUN SIGNATURE 


PR00392C 18.98 2.350e-12 41-62 
PR00392A 14.10 4.176e-12 4-14 
PR00392D 12.00 5.250e-ll 63-78 
PR00392F 17.40 7.955e-ll 122- 
140 PR00392E 12.06 6.500e-09 
109-123 


1531 


BL0041 4 


Profilin proteins. 


BL00414A 13.85 6.344e-13 2-16 
BL00414E 15.46 6,283e-09 105- 
120 


1531 


PR00392 


PROFILIN SIGNATURE 


PR00392C 18.98 2.350e-12 41-62 
PR00392A 14.10 4.l76e-12 4-14 
PR00392D 12.00 5.250e-ll 63-78 
PR00392F 17.40 7.955e-Il 106- 
124 PR00392E 12.06 8.833e~09 
93-107 


1532 


PD00301 


PROTEIN REPEAT MUSCLE CALCIUM- 

131. 


PD00301A 10.24 8.200e-09 131- 

142 


1533 


DM01930 


2 kw FINGER SMCX SMC Y YDR096W. 


DM01930F 14.16 1.310e-27 24-60 


1534 


BL00411 


Kinesin motor domain proteins. 


BL00411G 21.39 2.200e-39 77^ 
119 BL00411H 15.66 8*800e-33 
125-156 BL0041LF 14.77 6.250e- 
Jo 


1534 


PR00380 


KINES1N HEAVY CHAIN SIGNATURE 


PR00380D 9.93 7.923e-26 126- 
14o rKOOioOC 13.18 1.0U0e-2I 
76-95 PR00380B 12.64 1.621e^l6 


1534 


BL00893 


mutT domain proteins- 


BL00893 18.99 8.826e-09 176- 
201 


1536 


BL00600 


Aminotransferases class-Ill pyridoxal- 
phosphate attachment si. 


BL00600E 16.43 5.725e-15 164- 
193 BL00600G 12,43 7,000e-I4 
242-261 BL00600F 8.77 7.480e- 
11 207-220 BL00600D8.71 
1.750C-1O 143-157 


1537 


BLOU838 


Interleukins -4 and -13 proteins. 


BL00838A 12.35 8.696e-09 136- 
155 


1537 


PD01847 


PHOTOSYSTEM II PROTEIN REACTION 
CENTRE I TRANSM. 


PD01847 9.59 8.946e-09 137-173 


1539 


PR00121 


SODIUM/POTASSIUM-TRANSPORTING 
ATPASE SIGNATURE 


PR00121D 16.72 3.012e-I2 261- 
283 
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NO: 


entry ID 


Description 


Results * 


1539 


RI ,001 


jm-gz ri l rdticii pnospnoryiauun site proteins. 


fc>L.UUtj4fc zU.i/ o.4ooe-lo 
573 BL00154C 12.38 3,520e|12 
ztw-zoi & 


1539 


BL0I228 


Hypothetical cof family proteins. 


BL01228D 17.44 6.400e~I 1 6§)- 

roc w 

685 J 


1539 


PR00119 


P-TYPF, CATION-TRANSPORTING 

ATPAQF QTlPPttft AlVrTf V QJlTMATT TRT? 
r\ xl ftoH OUrCIvT/VlVllLi I OlAJiN/\l UtvJUi 


PR00119B 13.94 3.333e-ll 268^ 

OQ1 D'DAfHlfllTiO C£ £ A£i*. in 

zoi rKUUl lyu y.jo o.063e-10 
548-559 


1540 


BL0Q289 


Pentaxiii family proteins. 


BL00289A 30.36 9.031e-09 331- 
362 


1j4z 




Protei n-L-i soaspartate(D-aspartate) 0- 
methyltransferase signa. 


BL01279A 24.27 l.OOOe-lt 67- 
115 




JdL00422 


Granins proteins. 


BL00422C 16.18 7.1 76e-09 303- 
331 


1545 


BL00027 


'Homeobox* domain proteins. 


BL00027 26.43 4.462e-32 244^ 
287 


1545 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00025B 11.94 3.143e-12 230- 
246 


1545 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 3.500es-12 276- 
286 PR00024A11.87 7.000e-I2 
251-263 PR00024B 11.27 1.409e- 
10 266-277 


1545 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.4l4e~l0 267- 
284 


1545 


BL00032 


'Homeobox' antennapedia-type protein. 


BL00032B 10.83 1.675e-37 233- 
272 BL00032C 11.28 4.429e-21 
272-290 BL00032A 18.38 
5/750e-10 193-216 


1546 


PD00066 


PROTEIN ZINC-FINGER METAL-BlNDi. 


PD00066 1 3,92 6.400e- 1 6 245- 
258 PD00066 13,92 8.615e-15 
329-342 PD0Q066 13.92 6.000e- 
13 301-314 PD00066 13.92 
4.857e-12 217-230 PD00066 
13.92 L346e-10 273-286 
PD00066 13.92 8,200e-09 357- 
370 


1546 


BL00028 


Zinc finger* C2H2 type, domain proteins. 


BL00028 16.07 5.950e-13 313- 
330 BL00028 16.07 7.261 e- 12 
229-246 BL00028 16.07 3.077e- 
11 16-33 BL00028 16.07 3.769e- 
11 285-302 BL00028 16.07 
9.308M 1 341-358 BL00028 
16.07 3. 100e-10 397-414 
BL0002S 16.07 5.800e-10 201- 
218 BL00028 16.07 6.400e- 10 
369-386 BL00028 16.07 7.600e- 

BLUUUzo 16.07 
8.800e-10 72-89 BL00028 16.07 
9,229^09 101-118 


1546 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 3,1 18e-I2 310- 
324 PR00048B 6.02 5.000e-l 2 * 
326-336 PR00048A 10,52 6.294e- 
12 13-27 PR00048D 6.02 L692e- 
11 242-252 PR00048A 10,52 
3.842e-ll 338-352 PR00048A 
10.52 5.263e-li 366-380 
PR00048A 10.52 8.579e-ll 226^ 
240 PR00048A 10.52 8.579e=ll 
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NO: 


Database 
eiitryJD 


Description 


Results * 






— — — ■ — r 

I 


254-268 PR00048A 10.52 3 348e- 
10 394-408 PR00048A 10.52 
4.91 3e-l 0 282-296 PR00048B 
6.02 7.188e-10 298-308 
PR00048B 6.02 9.053e-09 57-67 
PRO0048A 10.52 9.640e~09 98- 
112 


1547 


BL00585 


Ribosomal protein S5 proteins. 


BL00585B 18,78 6.143e-18 303- 
340 BL00585A28.43 4.286e-l6 
220-272 


1548 


PR00482 


OMPTIN SERINE PROTEASE 
SIGNATURE 


PR00482C 1 1.02 7.968es09 816- 
842 


1549 


PR00500 


POLYCYSTIC KIDNEY DISEASE 
PROTEIN SIGNATURE 


PR00500B 7.74 7,359e-l0 56-77 


1551 


PR00917 


SMALL ROUND STRUCTURED VIRUS 
(C37) CYSTEINE PROTEASE FAMILY 
SIGNATURE 


PR00917G 10.59 8.990e~09 812- 
830 


1553 


BL00028 


Zinc f\n&£t C2H7 tvne da ma in nrotefn«: 


126 


1555 


PF00638 


RanBPl domain proteins. 


PF00638 11.91 4.600e-18 67-82 


1555 


DM01269 


303 kw ACTIVATING RAN GTPASE 

T^OyYMF 

X&KyJLi X IVLL/. 


DM01269A 23.35 l.600e»2068- 
138-148 


1556 


BL00406 


Actins proteins. 


BL00406E 8.44 8.541e-28 323- 
373 BL00406B 5.47 1.375e-27 
82-137 BL00406D 12.58 3.160e- 
26 266-321 BL00406C 6.75 
6 943e-25 141-196 RI 0H406A 
9.95 2.575e-20 7-42 


1556 


PR00190 


ACTIN SIGNATURE 


PR00190F 7 80 3 647e-13 139* 
\ 59 PR001 90C 1 1 .49 2.029e-l 2 
60-83 PR00I90G 12.62 2.050e-09 
233-250 


1558 


BL00048 


Protamine PI proteins. 


BL00048 6.39 3.700e-09 153 A 80 


1558 


DM01206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 9.328e«ll 157- 
177 DM01206B 10.69 1.247e-10 
236-256 DM0I206B 10.69 
7.781e-10 188-208 DM01206B 
10.69 6.582e-09 234-254 


1559 


PR00315 


GTP-BIND1NG ELONGATION FACTOR 
SIGNATURE 


PR00315A 1 1.81 5.688e^l0 126- 
140 


1559 


PR00326 


GTP1/OBG GTP-BINDING PROTEIN 
FAMILY SIGNATURE 


PR00326A 8.75 1.000e-09 127- 
148 


1559 


PR00449 


TRANSFORMING PROTEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 6.43 le-09 125- 
147 


1559 


PR00755 


AFLATOX[N BIOSYNTHESIS 
REGULATORY PROTEIN SIGNATURE 


PR00755F 10.99 9.722e-09 30-52 


1563 


PF00780 


Domain found in NIKI-like kinases, mouse 
citron and yeast ROM. 


PF00780B 23.03 9.908e^09 14-57 


1567 


BL00162 


Eukaryotic-type carbonic anhydrases proteins. 


BL00162C 17.78 1.000e-40 88- 
12S BL00162E 14.93 7.23!e-39 
171-204 BL00162F 22.68 5,050e- 
31 208-242 BL00162A 22.92 
8.7l4e-30 16^47 BLO0I62D 
15.06 7J58e-24 126-151 
BL00I62B 21.43 1.375e-19 51-74 


1568 


PR00457 


ANIMAL HAEM PEROXIDASE 
SIGNATURE 


PR00457E 20.67 1.621e-24 414- 
441 PR00457D 16.81 8.258S-21 
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entry ID 


Description 


Results* 






s 

0 


389-410 PR00457B 13.29 3,455e- 
18 223-239 PR00457G 17.45 
7,000e-l 8 595-616 PR00457C 
19.25 4.414e-l 6 371-390 
PR00457H 15.90 8.650e^l4 666- 
681 PR00457A 15.80 5.645e-12 
169-181 PR00457F 13.69 8.875e- 
1 1 467-478 


1569 


BLG1304 


ubiH/COQ6 monooxygcnase family proteins. 


BL01304A 8.05 3.571e-li 50^64 


1569 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.76 3.769e-lO 50-73 


1569 


PR00757 


FLAVIN-CONTAINING AMINE OXIDASE 
SIGNATURE 


PR00757A 6.64 5.552e~l0 50-70 


1569 


BL00623 


GMC oxidoreductases proteins. 


BL00623A 12.60 2.929e-09 50-69 


1569 


PR00420 


AROMATIC-RING HYDROXYLASE 
(FLAVOPROTEIN MONOOXYGENASE) 
SIGNATURE 


PR00420A 14.78 6.455e-09 50-73 


1569 


BL00064 


L-lactate dehydrogenase proteins. 


BL00064A 21.16 7.203e-09 50-88 


1569 


PR00370 


FLAVIN-CONTAINING 
MONOOXYGENASE (FMO) SIGNATURE 


PR00370A 3.35 9.772e-09 50-66 


1571 


BL00019 


Actinin-typc actin- binding domain proteins. 


BL00019D 15.33 3.880e-17 145- 
175 


1573 


BL00893 


mutT domain proteins, 


BL00893 18.99 5.500e- 16 127- 
152 


1573 


PR00502 


MUTT DOMAIN SIGNATURE 


PR00502B 15.92 4.600e-13 138- 
154 PR00502A 15.06 2.636e-09 
124-139 


1574 


PF00632 


HECT-domain (ubiquitin-transferasc). 


PF00632B 18.45 7.000e-16 488- 
516 PF00632C 20,66 7.851e-14 
533-565 


1576 


PR00239 


MOLLUSCAN RHODOPS1N C- 
TERMTNAL TAIL SIGNATURE 


PR00239E 1.58 9.566e-10 292- 
304 


1576 


BL001I5 


Eulcaryotic RNA polymerase II heptapeptide 
repeat proteins. 


BL00115Z 3,12 5.632e-09 243- 
292 


1576 


DM00215 


PROLINE-RICH PROTEIN 3, 


DM00215 1 9 43 6 1 1 8e-l 1 296- 
329 DM00215 19.43 9.647e-Il 
327-360 DM00215 19.43 8.232e- 
10322-355 DM002 1 5 19.43 
Z068e-09 291-324 DM002 1 5 
19.43 2.983e-09 265-298 
DM00215 19.43 4.356e-09 292- 
325 DM002I5 19.43 7.712e-09 
275^308 DM00215 19.43 8.017e- 
09 266-299 DM00215 19.43 
8.475e^09 271-304 DM00215 
19.43 8.780e-09 286-3 19 


1582 


BL01280 


Glucose inhibited division protein A family 
proteins. 


BL01280A 15.97 6.727e-36 69- 
110 BL01280B 23.56 8.I05e-27 
128-180 


\m 


BL00076 


Pyridine nusleotide-disulphide 
oxidoreductases class-I. fr 


BL00076A 18.83 6.745e-l2 68-98 


1582 


BL00836 


Alanine dehydrogenase & pyridine nucleotide 
transhydrogenase. 


BL00836D 22.30 9.576e-12 69- 
106 


1582 


BL00504 


Fumaratc reductase / succinate dehydrogenase 
FAD-binding site proteins. 


BL00504A 10.76 3.870e~ll 69-91 


1582 


BL00977 


FAD-dependent glycero l-3-phosphate 
dehydrogenase proteins. 


BL00977A 20.76 8.583e-ll 69- 
121 
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NO: 


entry U> 






1582 


PR0041 1 


PYRFDINR M1JCLROTTDF DISTJT Pl-TTDR 
REDUCTASE CLASS-I SIGNATURE 




1582 


BL00982| 


Bacterial-type phytoene dehydrogenase 


BL00982A 18,41 2.151e-10 71- 
103 


1582 


PR00368| 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.76 8.846e-13 69-92 
PR0036SC 15.74 5.263e-10 69-95 


1582 


PR0041 9 


SIGNATURE 






r JxUV / J / 


T?| A A/TNI C 1 HMT A rMlM A AKyfTKTK AY I Pi A CU 

SIGNATURE 








Jr I tviJUIllNli NUv*L#JtvWl JJJJq 1 J JLo U l^iYil LJl/ 

REDUCTASE CLASS-H SIGNATURE 


PDnft/t£QA 1 ^ 1 o<iai in rift 
rlwU40yA 1 3 .40 1 . oM e** 1 U oy-92 

PR00469F 16.51 8.063e-09 65-90 






vjivil* oxiuoreuuciases proteins. 


til nOCOQ A 11 /CA O CO£a Art £c\ OO 

WL,UUo^JA 1Z.0O o.Dooe-09 oy-08 


1586 


PR004I3 


HALOACID DEHALOGENASE/EPOXIDE 

UVTM>A1 ACtt VA\A1J V OI/"*T\J A TT TDT7 

HYXJKULA&& FAMILY MOINAlUKb 


PR00413E 15.78 6.7l4e-09 70-87 


1587 


PD01861 


PROTEIN NUCLEAR 
RIBONUCLEOPROTEIN SMALL MRNA 
KINA. 


PD01861A 14.06 6.318e-10 60-84 


1588 


PD00289 


PROTEIN SH3 DOMAIN REPEAT 


PD00289 9.97 6.586e-09 46-60 


1588 


PF00595 


PD2 domain proteins (Also known as DHR 

Or uLur ). 


PF00595 13.40 9.400e-09 43-54 


1591 


BL009I4 


Syntaxin / cpimorphin family proteins. 


BL00914 24.91 1.250e-29 184- 
234 


1592 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PROOOtOC 11.16 6.667e-ll 363- 
374 


1592 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins. 


BL01 187B 12.04 6.538e-16 358- 

374 BL01 1 87A 9.98 3.250e-09 
ito inn 


1592 


BL00022 


EGF-like domain proteins. 


BL00022B 7.54 8.200e-09 367- 
374 




JrKUUOZD 


TYKTA T DDATDTKr 1? A WAIT V C1YTM A TI 1"DT? 

UMAJ JrKUljblN TAMIL X olUNAl UKJb 


DDArt/lt A 10 OA A £f\f\^ OA t A A 

rK00625A 12,84 4.600e-20 14-34 


1593 


BL00636 


Nt-dnaJ domain proteins. 


BL00636A 8.07 4.176e-18 18=35 
BL00636B15.il L000e-1 5 46-67 






TP A XT CPU TPTTAM PP f\TPT~M TYkT A 

BIND1N. 


390 






BINDIR 


PhrtO/l/l ft A O 7*7 1 1 1 & in * a on 

PD02448B 10.17 8.071e-19 89- 


1602 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 9.816e~ll 144- 
103-118 


1602 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 23-56 


1602 


BL01 159 




159 BL01159 13.85 6.891e-09 
103-118 


1602 


PR00571 


ENDOTHELIN-B RECEPTOR 
SIGNATURE 


PR00571G 5.36 7.750e-09 107- 
126 


1603 


PR00403 


WW DOMAIN SIGNATURE 


PR00403B 12.19 9.816e-ll 107- 
122 


1603 


BL01159 


WW/rsp5/WWP domain proteins. 


BL01 159 13.85 5.224e-l0 107- 
122 


1603 


DM00215 


PROLINE-RICH PROTEIN 3. 


DM00215 19.43 6.644e-09 23-56 


1605 


PR00929 


AT-HOOK-LIKE DOMAIN SIGNATURE 


PR00929B 4.38 4.600e-10 358- 
370 


1605 


DM00303 


6 LEA 1 1-MER REPEAT REPEAT, 


DM00303A 13.20 7.708e-10 306- 
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356 DM00303A 13.20 2.912e-09 
304-354 DM00303A 13.20 
7.212c-09 300-350 DM00303A 
13.20 7.212e-09 311-361 


10U-J 


DT AAOCyl 


nMO-l and HMG-Y DNA-bi tiding domain 
proteins (Ahook). 


BL00354B 3,16 7.722e-09 357- 
370 


loUo 


ruU2J79 


AMINOTRANSFERASE BIOSYNTHESIS 
PHOSPHOSERINE SER. 


PD02379E 11.43 1.000e-40 194- 
236 PD02379F 18.62 6.029e-35 
245-284 PD02379H 16.03 
5.235e-33 352-385 PD02379B 
12.05 3.61 3e-31 80-113 
PD02379A 15,57 2.800e«25 29-60 
PD02379C 13.34 3.700e-21 1 19- 
139 PD02379D 11.83 9.4l9e=16 
168-181 PD02379G 10.62 
2.537e-14 313-328 


1606 


PR00800 


AROM ATIC-L- AMINO-AC ID 
DECARBOXYLASE SIGNATURE 


PR00800G 11.29 1.889e-09 97- 
118 


1607 


PD02379 


AMINOTRANSFERASE BIOSYNTHESIS 
PHOSPHOSERINE SER. 


PD02379E 11.43 1.000e-40 194- 
236 PD02379F 18.62 6.029e-35 
245-284 PD02379B 12.05 3.613e- 
31 80-113 PD02379A 15,57 
2,800e-25 29-60 PD02379H 
16.03 7.864e-23 306-339 
PD02379C 13.343.70Oe-21 119- 
139 PD02379D 1 L83 9.419e-16 
I68-ISI 


1607 


rS\\]\Jo\J\J 


DECARBOXYLASE SIGNATURE 


PR00800G 11.29 1.889e-09 97- 
118 




JrJtwVO/ t r 


SIGNATURE 


rKUOo /4L 4.37 6,625e-09 33-48 


1614 


BL00035 


•POU' domain proteins. 


BL00035B 14.46 6.236e-09 683- 
704 


1616 


PF00168 


C2 domain proteins. 


PF00168C 27.49 8.412e-13 634- 
660 


1616 




\s£> \.J\)vJ\.t\\VA MvJiN/V 1 UivC/ 


PR00360B 13.61 8.1G5e-10 651- 
665 


1617 


PF00168 


C2 domain proteins. 


PF00168C 27.49 8.412e-13 115- 

141 


1617 


PR00360 


C2 DOMAIN SIGNATURE 


PR00360B 13.61 8.105c-10 132- 
146 


1619 


PR00566 


DOPAMINE IB RECEPTOR SIGNATURE 


PR00566E 13.44 5.255e-18 466- 
483 rK0U566A 9*32 3,000e-17 
200-214 PR00566D9.35 1.600e- 
12 446-455 PR00566C 11.44 
Z184e-12 401-412 PR00566B 
8 20 3 053e-ll 141^1 


1619 


PR00242 


DOPAMINE RECEPTOR SIGNATURE 


PR00242E 13.29 1.000e-I2 424- 
439 PR00242B 11.77 B.650e-ll 
257-267 


1619 


BL00237 


G-protein coupled receptors proteins. 


BL00237C 13.19 6.786e~20 364- 
391 BL00237A 27.68 9.71 Oe- 1 5 
266-306 BL00237B 5.28 5.263e- 
10 309-321 


1619 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIONATURE 


PR00237F 13.57 5.800e~19 369- 
394 PR00237B 13.50 6.250e-19 
236-258 PR00237E 13.03 9.500e- 
15 301-325 PR00237C 15.69 
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3.925e^09 280-303 PR00237A 
11.48X387e-09 202-227 


1620 


PR00169 


POTASSIUM CHANNEL SIGNATURE 


PROOllPA 16.77 7.851e-ll 46-66 


1621 


DM00215 


PROLINE-RlCH PROTEIN 3. 


DMOfxfis 19.43 7.529e- 11 183- 

™-i . 


1621 


PF00685 


Sulfotransfcrase proteins. 


PF0068SC 26.03 5.100e-09 118- 
164 


1621 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 8.322e-09 198- 
213 


1622 


BL00951 


ER lumen protein retaining receptor proteins. 


BL00951D 14.23 1.670e-09 43-74 


1623 


BL00292 


Cyclins proteins. 


BL00292B 20.31 3.925e-ll 120- 
151 


1624 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 3. l60e-09 111- 
125 


1624 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 3.739e-12 114- 
131 BL00028 16.07 3.571e-09 
145-162 


1625 


BL01226 


Hydroxymcthylglutaryl-coenzyme A synthase 
proteins. 


BL012261 25.06 8,560e-09 256- 
304 


1629 


BL00030 


Eukaryotic RNA-bindmg region RNP-1 
proteins. 


BL00030A 14.39 1.563e-I2 72-91 
BL00030A 14.39 2.125e-12 156- 
175 


1637 


BL0074O 


MAM domain proteins. 


BL00740B 19.76 3.8l3e-09 637- 
658 


1637 


PR00597 


GELSOLIN FAMILY SIGNATURE 


PR00597G 8.55 6.586e^27 637- 
660 PR00597A 12.96 5.846e~26 
^zo-jffo rxsAivDy/ct ij.'+o z.uuue- 

9.526e-22 582-602 PR00597D 
12 77 1 000e-20 469-490 
PR00597B 9.78 2.500e-20 415- 
432 PR00597C 14.19 6.1 92e-20 
436-455 PR00597H 15.32 7.577e~ 
19 666-686 PR00597D 12.77 
3.392e-10 94-115 PR00597B9.78 
9.455e-10 36-53 PR00597C 14.19 
7.875e-09 61-80 PR00597A 12.96 
8.027e~09 689-711 


1641 


BL00027 


'Homeobox* domain proteins. 


BL0OO27 26.43 7.000e-l 1 93-136 


1641 


PR00S87 


STRUCTURE-SPECIFIC RECOGNITION 
PROTEIN SIGNATURE 


PR00887D 15.12 8.909e-09 337- 
351 


1646 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259A9.27 3.308e-I8 19-43 
PR00259C 16.40 9.800e-18 88- 
117 PR00259D 13.50 2.756e-15 
238-265 


1646 


BL00421 


Transmembrane 4 family proteins. 


BL00421A UJ9 5,263e-I4 15-34 
BL00421E 20.97 4.632c-13 235- 
265 


1651 


PR00669 


INHIBIN ALPHA CHAIN SIGNATURE 


PR00669F 5.57 9.899e-09 223- 
241 


1652 


DM01292 


ESICULAR LUMEN DOMAIN. 


DM0I292L lfi.54 9.505e-09 240- 
265 


1653 


PR00128 


COLIPASE SIGNATURE 


PR00128D 9J7 6-250e-25 47-66 
PR00128C 9.28 5.299e-20 24-47 


1653 


BL00121 


Colipase proteins. 


BL00121B 9.96 3.160e-33 15-64 
BL00I21A 14.56 2.107e-09 16-56 


1656 


PR00O49 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 9.929e-10 384- 
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: j _ 


399 


1658 


BL01118 


Translation initiation factor SUI1 proteins. 


BL01 118B 26.75 8.579e-26 94- 
132 BL01 1 ISA 12,46 4.000e-13 
77-92 


1659 


BL00811 


Oleostns proteins. • 


BL0081 1 A 8.26 3.310e-09 120- 
158 


1660 


BL00674 


AAA-protein family proteins. 


BL00674B 4.46 9. 1 82e- 1 1 1 84-^ 
206 


1660 


PR00830 


endopephdase la (lon) serine 
protease (s 16) signature 


PR00830A 8.41 8.544e-10 191- 
211 


1660 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PR00300A 9.56 9.416e-09 187- 
206 


1660 


PR00051 


BACTERIAL CHROMOSOMAL 
REPLICATION INITIATOR (DNAA) 
SIGNATURE 


PR0005IA 10.68 9.899e-09 184- 
205 


1661 


DM01871 


kwSSRLIGASECYCLO 
FORMYLTETRAHYDROFOLATE. 


DM01871C 20,79 9.836e-10 270- 
296 


1663 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 7.71 2e-09 95-1 10 


1665 


BL01181 


Ribosomal protein S21 proteins. 


BL01181 15.43 2.500e-10 13-49 


1666 


PR00259 


TRANSMEMBRANE FOUR FAMILY 
SIGNATURE 


PR00259C 16,40 6.824e-16 88- 
117 PR00259A9>27 3.423e-14 
24-48 PR00259D 13.50 1.574e-i3 
238-265 PR00259B 14.81 8.714&- 
13 61-88 


1666 


BL00421 


Transmembrane 4 family proteins. 


BL00421B 17.62 4.600e-I9 67- 
106 BL00421E 20.97 6.21 
235-265 BL0042IA 11.79 
5.600e- 12 20-39 


1668 


PR00496 


NAPIN SIGNATURE ' 


PR00496A 6,68 6,276e^09 21-43 


1671 


PR00237 


RHODOPSIN-LIKE GPCR SUPERFAMILY 
SIGNATURE 


PR00237G 19.63 2.543e-l 1 670- 
697 PR00237A 11.48 3.000e-10 
424-449 


1671 


PR00373 


GLYCOPROTEIN HORMONE RECEPTOR 
SIGNATURE 


PR00373D 11.16 2.403e-09 503- 
518 


1671 


BL00237 


G-protein coupled receptors proteins. 


BL00237A 27.68 6>600e-l 0 496- 
536 BL00237D 11.23 4.545e-09 
680-697 


167! 


PR00910 


LUTEOVIRUS ORF6 PROTEIN 
SIGNATURE 


PR00910A2.51 7.429e-09400- 
413 


1671 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B ll.36 6.000e-tl 94- 
108 PR00019A 11.19 7.300e-ll 
215-229 PR00019B 1 1.36 6.8506- 
10 46-60 PR00019A 11.19 
8.043e- 10 285-299 PR00019B 
1 1 .36 5.320e-09 21 2-226 
PR00019B 1 1.36 9.640e-09 70-84 


1672 


BL00972 


Ubiquitin carboxyl-terminal hydrolases 
family 2 proteins. 


BL00972A 1 1.93 7.500e-20 36-54 
BL00972D 22.55 6.806e-16 296- 
321 BL00972B9.45 1.000e-l3 
116-126 BL00972E 20.72 8.773c- 
12 321-343 


1673 


PF00646 


F-box domain proteins. 


PF00646A 14.37 6.906c09 92- 
106 


1675 


BL00933 


FGGY family of carbohydrate kinases 
proteins. 


BL00933D 24.01 7.545c-15 212- 
249 BL00933B 15.942.200e~09 
54-65 BL00933E 13.80 3,543e-09 
439-455 BL00933A 17.50 
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4.857e~09 20-44 


1676 


PR00876 * 


NEMATODE METALLOTHIONEIN 
SIGNATURE 


PR00876B 7.66 1.8S7e-10 137- 
151 


1676 


BL00026 1 


Chitin recognition or binding domain 
proteins. 


BL00026 12.95 2.776e-09 55-76 


1676 


BL01208 * 


VWFC domain proteins. 


BL01208B 15.83 2.946e-09 172- 
187 


1676 


BL01282 


BIR repeat proteins. 


BL012R2R 'W 4<5 d 471p-0Q no 
169 


1676 


PR00875 


MOLLUSC METALLOTHIONEIN 
SIGNATURE 


PR00875B S 24 4 64Qp-ftQ t 'Xl- 
145 


1676 


BL00956 


Fungal hydrophobic proteins. 


BL00956B 8.29 4.682e-09 153- 


1676 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE E2 
PRECURSOR PEPLOMER. 


PD00866L 3,73 6.564e-l0 1-1 1 
PD00866L 3.73 l.443e-09 33-43 

PD00866L 3.73 2.918e-09 19-29 
PD00S66L 3.73 4.836e-09 149- 


1676 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 6.288e-09 141~ 
169 


1676 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 7.465e-10 17^37 
DM01724 8.14 4.434e-09 19-39 
DM01724 8.14 6.684e-09 10-30 


1676 


PR0085S 


CRUSTACEAN METALLOTHIONEIN 
SIGNATURE 


PR00858B 5.93 5.883e-09 155- 
174 PR00858B 5.93 8.085e-09 
136-155 


1676 


PR00874 


FTJNGI-IV METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 125- 
140 PR00874C 4.37 9.Q00e-O9 
135-150 


1676 


BL00243 


Integrins beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 3.779e-10 38-81 
BL002431 31.77 4.309e-10 68-1 1 1 
BL002431 31.77 5.235e-10 58-101 
BL002431 31.77 7.353e~10 98-141 

BL002431 31.77 1.000e-09 88-131 
BL002431 31.77 1.380e-09 121- 
164 BL00243I 31.77 2.648e-09 
1 19-162 BLD0243I 31.77 3.662e- 
0961-104 BL002431 31.77 
4.296G-09 131-174 BL00243I 
3 1.77 4.676e-09 48-91 BL00243I 
31.77 6.704e-09 109-152 
BL002431 31/77 7,845e-09 25-68 
BL002431 31.77 9,366e-09 134- 
177 


1676 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 3.885e>-16 128- 
174 BL00203 13.94 8.607e-13 
123-169 BL00203 13.94 2.780e- 
11 153-199 BL00203 13.94 
3.571e-ll 148-194 BL00203 
13.94 4.363e-ll 113-159 
BL00203 13.94 5.451(^11 139- 
185 BL00203 13.94 6.934e-l 1 
144-190 BL00203 13.94 9.209e- 
11 13M77 BL00203 13.94 
2.436e-10 35-81 BL00203 13.94 
4.255e-10 133-179 BL00203 
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13.94 6.745e-l 0 32-78 BL00203 
1 3.94 7, 032e-l 0 154-200 
BL00203 13.94 2.929e-09 34-80 
BL00203 13.94 3,388e-09 149- 
195 BL0O203 13.94 3.571 e-09 
136-182 BL00203 13.94 5.224e- 
09 127-173 BL00203 13.94 
5.776e*09 43-89 BL00203 13.94 
6,878e^09 140-186 BL00203 
13.94 7.796e-09 45-91 BL00203 
13.94 9.541e-09 42-88 


1679 


PD01976 


KINASE DEHYDROGENASE 
TRANSFERASE. 


PD01976A 8.95 1.493e-09 83-96 


1680 


BL00623 


GMC oxidoreductases proteins. 


BL00623A 12.60 9.859e-10 12-31 


1680 


PR00419 


ADRENODOXIN REDUCTASE FAMILY 
SIGNATURE 


PR00419A 14.89 4.729e~09 12-35 


1680 


PR00368 


FAD-DEPENDENT PYRIDINE 
NUCLEOTIDE REDUCTASE SIGNATURE 


PR00368A 17.76 9.357e-09 12-35 


1683 


BL01172 


Ribosomal protein L44e proteins. 


BL01172B 14.10 8.909e-38 15-57 
BL01172C 16.78 7.188e-31 63- 
102 


1685 


DM01724 


kw ALLERGEN POLLEN CIM1 HOL-LI. 


DM01724 8.14 5.909e«ll 11-31 
DM01724 8.14 6.591e-l I 41-61 
DM01724 8. 14 6,S31e-10 39-59 
DM01724 8.14 8.697e-09 55-75 


1686 


PR00320 


G-PROTEIN BETA WD-40 REPEAT 
SIGNATURE 


PR00320A 16.74 8.463e-09 73-88 


1690 


PD02269 


CYTID1NE DEAMINASE HYDROLASE 
ZINC AMINOHY. 


PD02269C 16.367.882e-l7 79-92 
PD02269A 10.06 1.000e-15 29-41 
PD02269D 1 1.98 5.000e-I4 1 10- 
125 


1691 


BL00790 


Receptor tyrosine kinase class V proteins. 


BL00790D 12.41 8.297e-09 429- 
454 


1692 


BL00750 


Chaperonins TCP-1 proteins. 


BL00750B 16.17 2.000e-39 69- 
119 BL00750A 20.07 8.286e-36 
25-68 BL00750C 25.65 8.579e-23 
152-184 


1692 


PR00304 


TAILLESS COMPLEX POLYPEPTIDE 1 
(CHAPBRONE) SIGNATURE 


PR00304C 8,69 1.250e-18 86-106 
PR00304B 1 1.60 2.059e-17 56-75 
PR00304A 9.20 3.605e-15 34-51 


1692 


PR00298 


60 KD CHAPERONIN SIGNATURE 


PR00298B 13.59 7.353e-ll 88- 
116 


1692 


BL00296 


Chaperonins cpn60 proteins. 


BL00296B 15.98 4.1 15e-13 76- 
130 BL00296A 17.20 5.648e-10 
12-66 


1694 


BL00415 


Synapsins proteins. 


BL00415N 4,29 4.710c-10 225- 
269 


1694 


PD01234 


PROTEIN NUCLEAR BROMODOMAIN 
TRANS. 


PD01234B 15.53 5.875e-10 243- 
261 


1694 


BL00795 


Involucrin proteins. 


BL00795C 17.06 7,698e-10 213- 
258 


1694 


PR00208 


GLIADIN AND LMW GLUTENIN 
SUPERFAMILY SIGNATURE 


PR00208A 12.59 9 _384e-09 247- 
265 


1694 


DM00406 


GLIADIN. 


DM00406 7.73 9.800e-09 245-258 


1696 


BLO0028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 I.000e»09 212- 
229 BLO0028 16.07 6.143e-09 
365-3S2 
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1700 


BL00030 


Eukaryotic RNA-binding region RNP-1 
proteins. 


BL00030A 14.39 6.143e^]3 3$2- 
351 | 


1700 


PD02784 


PROTEIN NUCLEAR 
RIBONUCLEOPROTEIN, 


PD02784B 26.46 6.943e-09 44& 
485 f 


1701 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 8.500e-10 28* 
299 PF00023A 16.03 9.625e-10 
347-363 PF00023A 16.03 1.321e- 
09 184-200 PF00023A 16,03 
1.643e~09 150-166 


JL /vl 


ftt a o on a 

DLA}\Jj\JO 


Uroporphyrinogen decarboxylase proteins. 


BLUU906D 24.33 7.750e-09 212- 
256 


17A1 


PPAA70 1 


Domain present in ZO-1 and Unc5-like netrln 
receptors. 


PF 0079 IB 28.49 8.I59e-I4 117- 
172 PF00791B 28.49 8.31 9e- 13 
2.1 fTUWyiB Zo.49 3.1 79e- 
12 184-239 PF00791B 28.49 
D.lOoe-lz J4/-4UZ rrU0/91B 
28.49 5.727e-l 1 250-305 

DTTAA*7Q 1 13 OB AQ O Qt 7 n AG 1 *7 71 

PF00791B 28.49 8.514e-09 84- 
139 PF00791C 20.98 1.000e-08 
98-137 


1702 


RI 00107 


rruicin Kiiiaaca rvn -oicaing region proteins. 


JXl AA 1 A7R 1111 1 7o t A OAO 

218 


1702 


PF00992 


Trnnnnin 


Pf?AAQQ7 A 1 ft £7 0 ^o/^ AO 7 AO 

rrvuyyji/v 10,0/ y.ozoc-uy /*fy- 
784 


1708 


PR00671 


1NHIB1N BETA B CHAIN SIGNATURE 


PR00671C 4.18 8.966e-09 212- 
232 


1 7flQ 

1 / U7 


PROOFS 


SUBUNIT SIGNATURE 


rKUUO / 5JH y. 1 3 /.oUjC-Iz z9z- 
315 


1710 


PR004I2 




DP AA/1 1 OP 1 1 lA O A1 1 & 1 O f £0 

183 PR00412A 13.23 7.947e-12 

104-171 PRftft4l7Ft 19 W 7 47Qp 

10 123-139 


1711 


PR00217 


43 KD POSTSYNAPTIC PROTKTN 
SIGNATURE 


PRftn717P 10 01 7 747a 10 703 

309 


1712 


PR00320 


G-PROTEIN BETA WD-40 RFPPAT 
SIGNATURE 


PPOO^OA 16 74 Q 177p-00 777.. 
292 PR00320A 16 74 9 780e-09 
233-248 PR00320C 13.01 LOOOe- 
08 233-248 


1713 


BL01230 


RNA methyltransferese trmA family proteins. 


BL01230E 15.79 2.91 8e- 11 487- 
503 


1719 


BL00039 


DEAD-box subfamily ATP-dependent 
helicaaes proteins* 


BL00039D 21.67 2.957e-09 434- 
480 


1719 


DM01537 


kw SKI2W SKI2 NUCLEOLAR HELECASE. 


DM01537B 2L63 7.830e-09 408- 
455 


1721 


PR00453 


VON WILLEBRAND FACTOR TYPE A 
DOMAIN SIGNATURE 


PR00453A 12.79 2.957e~10 33-51 


1721 


PR00527 


GASTRIN RECEPTOR SIGNATURE 


PR00527I 5.36 6.559e-09 419-439 


1721 


PR00014 


FIBRONECT0M TYPE III REPEAT 
SIGNATURE 


PR00OI4C 15.44 6.870e-09 381- 
400 


1721 


PR00477 


PHOSPHOGLYCERATE KINASE FAMILY 
SIGNATURE 


PR004771 8.53 1 ( 000e-08 168-186 ' 


1725 


PR00493 


BREAST CANCER TYPE I 
SUSCEPTIBILITY PROTEIN SIGNATURE 


PR00493G 7.57 3.71 le-14 693- 
714 


1726 


BL00443 


Glutamine amidotransferases class-II 
proteins. 


BL00443F 16.68 8.7l4e-09 85- 
101 


1728 


DM01206 


CORONAVIRUS NUCLEOCAPSID 


DM01206B 10.69 7.288e-10 167- 



305 



WO 02/070539 



PCT/US02/05095 



SEQIB 
NO: 



Database 
entry II> 



Description 



Results* 



PROTEIN. 



187 



1728 



DM00215 



PROL1ME-RICH PROTEIN i 



DM002L5 19.43 3.41 le-10 331- 
364 DM00215 19.43 7.1 07e-10 
336-369 DM00215 19.43 9.679e- 
10 335-368 DM0021519.43 
3.136e-09 342-375 DM00215 
19.43 5.1 19e-09 315-348 
DM00215 19.43 8.322e-09 326- 
359 



1728 



PR00049 



WILM'S TUMOUR PROTEIN SIGNATURE 



PR00049D 0.00 8.627e-09 335- 
350 



1728 



BL00048 



Protamine PI proteins. 



BL00048 6, 
BL00048 6, 
BL00048 6, 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 
BL00048 6. 



39 5026e-10 
39 6.329e-10 
39 8,224e-10 
39 3,363e~09 
39 3.475e-09 
39 3.925e-09 
39 4.150e-09 
39 4.150e-09 
39 4.825G-09 
39 5.8386-09 
39 8.200e-09 
39 9,550e-09 



152- 179 
173-200 
161-188 
155-182 
163-190 
167-194 
15M78 
159-186 
171-198 

176- 203 

177- 204 

153- 180 



1728 



PR00211 



GLUTELIN SIGNATURE 



PR00211B0.86L827e-tl 345- 
366 PR00211B0.86 3.571e-ll 
339-360 PR002I IB 0.86 6.91 7e- 
09 325-346 PR00211B0.86 
1.000e-08 351-372 



1731 



BL01221 



PMP-22 / EMP / MP20 family proteins. 



BL01221C 2620 1.281e^34 59- 
104 BL01221D 13.99 5.966e-27 
136-163 BL01221A 17.26 
2.385e-26 1-29 BL01221B 13,29 
1.000e-14 38-52 



1733 



1733 



BL00027 



'Homeobox 1 domain proteins. 



BL00027 26.43 4.000e-10 297- 
340 



PR00024 



HOMEOBOX SIGNATURE 



PR00024A 1 1 .87 4. 1 50e-09 289- 
301 



1734 



BL00027 



'Homeobox 1 domain proteins. 



BL00027 26.43 4.000e-10 297- 
340 



1734 



PR00024 



HOMEOBOX SIGNATURE 



PR00024A 1 1.87 4.150e-09 289- 
301 



1738 



BL00303 



S-100/ICaBP type calcium binding protein. 



BL00303B 26.15 5.075e-l3 73- 
110 



1738 



BL00018 



EF-hand calcium-binding domain proteins. 



BL00018 7.41 3.400e-l2 85-98 
BL00018 7.41 8.043e-09 49-62 



1738 



PR00450 



RECOVERIN FAMILY SIGNATURE 



PR00450C 12.22 6.582e-09 44-66 
PR00450C 12.22 9-772e-09 80- 
102 



1740 



PD01941 



TRANSMEMBRANE COTRANSPORTER 
SYMP. 



PD01941C 19.96 4.960e-I6 84- 
139 PD01941B 15.02 2.093e-Il 
4-51 



1742 



BL00672 



Serine proteases, V8 family, histidine 
proteins^ 



BL00672B 9.84 3.554e-09 214- 
231 



1742 



1745 



PR00839 



V8 SERINE PROTEASE FAMILY 
SIGNATURE 



PR00839E 12.04 8.062e-09 213- 
230 



BL00674 



AAA-protein family proteins, 



BL00674B 4.46 7.814e-10 360- 
382 



1745 



DM01022 



LRR REPEAT. 



DM01022A 7.35 1.900e-09 954- 
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SEQ ID 
NO: 


Database 
entry ID 


Description 

* : 


Results* 








961 


1745 


PR00049 


iWILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 6. 1 86e-09 267- 
282 


1746 


BL00674 


BiAA-protein family proteins. 


BL00674B 4,46 7.814e-10 360- 
382 


1746 


DM01022 


Err REPEAT. 


DM0 1022 A 7 35 1 900e-09 954- 
961 


1746 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 6. 1 86e-09 267- 
282 


1747 


BL00215 


Mitochondrial energy transfer proteins. 


BL00215A 15.82 9.591e-18 206- 
231 BL00215A 15.82 4.0006-15 
104-129 BL00215A 15.82 
9.400e*15 7-32 BL00215B 10.44 
1 000e-10 154-167 


1747 


PR00927 


ADENINE NUCLEOTIDE 
TRANSLOCATOR t SIGNATURE 


PR00927G 11.079.036e~ll 158- 
174 PR00927B 14.66 4.652e-10 
239-261 


1747 


PR00926 


MITOCHONDRIAL CARRIER PROTEIN 
SIGNATURE 


PR00926F 17.75 2.826e-G9 9-32 
PR00926F 17.75 3.217e-09 208- 
231 


1749 


BL01013 


Oxysterol-binding protein family proteins. 


BL010I3A 25.14 5.500e-21 537- 
573 BL01013D 26.81 2.161e-18 

cn7-.ft*;i ht ai 01 sr* a on a on «. 
ou'-opI HLU1U13L- y.y/ 4.z31e- 

13 625-635 BL01013B 11.33 

3.01 7e-ll 603-614 


1751 


BL00711 


Lipoxygenases iron-binding region proteins. 


BL007111 18.56 8.630e-28 577- 

414-451 BL00711G 21.83 
9.100e-22 503-535 BL00711C 

90 75 5 OSOpuI O 9fiR 707 
BL00711D 17.56 L923e=16 347- 
373 BL0071 1H 23.34 L771e-12 
535-574 BL0071IF 19.79 2.086e- 
10 484-501 


1751 


PR00087 


LIPOXYGENASE SIGNATURE 


PR00087C 15 00 1 lJt4f>-l7 4?"?- 
444 PR00087A 18.37 7.061e-12 
385-403 PR00087B 15.25 5.091 e- 
10 403-421 


1751 


PR00467 


MAMMALIAN LIPOXYGENASE 
SIGNATURE 


PR00467E 9.00 3.400e-14 344- 
364 PR00467D 16.69 4.082e-09 
243-265 


1753 


PR00492 


RHO PROTEIN GDP DISSOCIATION 
INHIBITOR SIGNATURE 


PR00492C9.68 1.900e-23 122- 
139 PR00492B 9.77 8.579e-23 
76-95 PR00492D 14.82 8.200e-21 
139-155 PR00492A 11.92 L643e- 
18 60-76 


1756 


BL00378 


Hexokinascs proteins. 


BL00378A 19.01 8.500e-09 403- 
431 


1757 


BL00027 


'Homeobox' domain proteins. 


BL00027 26.43 8.615e-33 35-78 


1757 


BL00032 


'Homeobox* antennapedia-type protein. 


BL00032B 10.83 4.259e-27 24-63 
BL00032C 11.28 5.909e-20 63-81 


1757 


PR00025 


HOMEOTIC ANTENNAPEDIA PROTEIN 
SIGNATURE 


PR00025B I L94 4.0006-11 21-37 


1757 


PR00031 


LAMBDA AND OTHER REPRESSOR 
HELIX-TURN-HELIX SIGNATURE 


PR00031B 16.29 4.960e- 11 58-75 


1757 


PR00024 


HOMEOBOX SIGNATURE 


PR00024C 7.49 9.357c-13 67-77 
PR00024B 11.27 3.500e~ 11 57-68 
PR00024A 11.87 9.400e-lt 42-54 
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SEQ ID 


Database 


Description 


Results* 


NO: 


entry ID 






1758 


PR00179 


L1POCALIN SIGNATURE 


PR00I79B 9.56 l,000e-12 102- 
115 PR00179C 19-02 1.000e-10 
130-146 PR00179A 13,78 5,680e~ 
10 37-50 


1758 


BL00213 


Lipocalin proteins. 


BL002I3B 8.78 8.000e-I0 102^ 
133 BL00213A 12.95 9.526e-I0 
37-51 


1759 


BL00290 


Immunoglobulins and major 
histocompatibility Complex proteins. 


BL00290A 20.89 l,818e-ll 164- 
187 


1762 


BL00269 


Mammalian defensin^ proteins. 


BL00269C 16.52 7.158e-09 17N 
200 


1762 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 5.855e-10 57-85 
PD02283C 17.54 5.855e~10 87- 
115 PD02283C 17.54 6.566* 10 
117-145 PD02283C 17.54 L450e- 
0947-75 PD02283C 17.54 
L450e-09 77-105 PD02283C 
17.54 1.450e-09 107-135 
PD02283C 17.54 5.61 3e-09 67-95 
PD02283C 17.54 5.613e-09 97- 
125 PD02283C 17.54 6.1 75e-09 
137-165 PD02283C 17,54 7.525e- 
09 37-65 PD02283C 17.54 
8,875e-09 147-175 


1762 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13,94 3.379e~12 95-141 
BL00203 13.94 3.690e-12 65-1 ) I 
BL00203 13.94 2.978e=ll 35-81 
BL00203 13.94 5.549e-l 1 39-85 
BL00203 13.94 6.538e-l I 55-101 
BL00203 13,94 6.5380-11 85-131 
BL00203 13.94 7.23 le-11 34-80 
BL00203 13.94 Z429e-ll 125- 
171 BL0020313.94 7.527e-U 
69-115 BL00203 13.94 8.220e-l 1 
99-145 BL00203 13.94 L670e-10 
64-110 BLO02O3 13.94 2,053e- 10 
94-140 BL00203 13.94 2.1 49e- 10 
124-170 BL002O3 13.94 2.819e- 
10 159-205 BL00203 13.94 
5.213e-10 54-100 BL00203 13.94 
5.2I3e-10 84-130 BL00203 13.94 
5.691e-10 59-105 BL00203 13,94 
5.691e-10 89-135 BL0O203 13.94 
6.936e-10 129-175 BL00203 
13.94 7,51 Ie-10 115-161 
BL00203 13.94 7.702e-10 49-95 
BL00203 13.94 7.702e-10 79-125 
BL00203 13.94 1.55 1M)9 44-90 
BL00203 13.94 1. 55 le-09 74^1 20 
BL00203 13.94 3,1 12e-09 29-75 
BL00203 13.94 4.03 le-09 30-76 








BL00203 13.94 4.214e-09 90-136 
BL0O2O3 13.94 4.306e-09 60-106 
BL00203 13.94 5.133e~09 145- 
191 BL00203 13.94 6.235e-09 
119-165 BL00203 13.94 6.327e- 
09 50-96 BL00203 13.94 6.327e- 
09 80-126 BL00203 13.94 
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SEQ ID 
NO: 


Database 
entry ID 


Description 


Results* 








6.694e-094^91 BL00203 13.94 
6.694e-09 75*121 BL00203 13.94 
8.898e-0910Ll5O 


1763 


BL00216 


Sugar transport proteins. 


BL00216B 2§64 5.846e-09 141- 
191 | 


1766 


BL00456 


Sodium :solute symporter family proteins. 


BL00456A 22T59 2.080e-30 83- 
138 BL00456C 24.55 3.72 le-29 
221-276 BL00456B 18,94 l.OOOe- 
22 159-189 


1766 


PR00175 


SODIUM/ALANME SYMPORTER 
SIGNATURE 


PR0017SB 10.80 9.878e-09 226- 
245 


1767 


BL00I42 


Neutral zinc metallopeptidases, zinc-binding 
region proteins. 


BL00142 8.38 1.857e-09 494-505 


1768 


BL00509 


Ras GTPase-activating proteins. 


BL00509B 10,28 1.643e-12 610- 
621 


1772 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 6.l43e-13 252- 
266 PR00048A 10,52 7.429e-13 
476-490 PR00048A 10.52 3.1 18e- 
12 336-350 PR00048A 10.52 

10.52 4.706e-l2 504-518 
PR00048A 10.52 8.4l2e-12 224- 
238 PR00048A 10.52 3.842e-l I 
392-406 PR00048A 10.52 6.21 le^ 
11 308-322 PR00048A 10.52 
6.21 le-1 1 448-462 PR00048B 
6.02 7.23 le- 11 492-502 
PR00048B 6.02 3.250e^l0 240- 
250 PR00048A 10.52 6.870e- 10 
420-434 PR00048B6.02 2.421e- 
09 380-390 


1772 


PD00066 

* J-/ www 


PROTRTN ZINf-FINGFK MFTAI -RINDI 


340 PD00066 13.92 1.50Qe-13 
4H-424 PD00066 13.92 5.500e- 
13 383-396 PD00066 13.92 
5,500e-l 3 439-452 PD00066 
13.92 7.500e-l 3 495-508 
PD00066 13.92 9.000e-13 467- 
480 PD00066 13.92 3.571e-12 
355-368 PD00066 13.92 7.000e- 
12 27J-284 PD00066 13.92 
7.923e- 1 0 299-3 1 2 PD00066 
13.92 2.500e-09 243-256 


1772 


BL00028 


Zinc Finger, C2H2 type, domain proteins. 


BL00O28 16.07 3.250e-13 395- 
412 BL00028 16.07 5.950e-l3 
367-384 BL00028 16.07 6.478e- 
12 451-468 BL00028 16.07 
8.435e- 12 339-356 BL00028 
1 6.07 1.692e-l I 255-272 
BL00028 16,07 3,769e-ll 227- 
244 BL00028 l6-07 5.154e-ll 
507-524 BL00028 16.07 2.200e- 
10 479-496 BL00028 16.07 
9.400e-10 199-216 BL00028 
16.07 2.029e-09 423-440 
BL00028 16,07 3.571 e-09 311- 
328 


1773 


PR00122 


VACUOLAR ATP SYNTHASE 16 K.D 


PR00122D9.97 7.214e-ll 103- 
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SEQ 10 
NO: 


Database 
entry ID 


Description 


Results* 






SUBUNIT SIGNATURE * 


127 PR00122C 8.20 9.526e-I0 
76-103 


1773 


BL00605 


ATP synthase c subunit proteins. Ik 


BL00605 27.67 4.977e-09 70-124 


1774 


BL005I8 


Zinc finger, C3HC4 type (RING fingft), 
proteins. • 


BL00518 12.23 6.571e-10 30-39 


1776 


BL01160 


Kinesin light chain repeat proteins. 


BL01160B 19.54 L610e-09 33-87 
BL01160B 19.54 9,61 9e-09 65- 
119 


1783 


PD0241 1 


PROTEIN TRANSCRIPTION 
REGULATION NUCLEAR 


PD02411 21.89 6.786e-15 3967- 
4001 


1783 


PF00856 


SET domain proteins. 


PF00856B 16.42 6.595e-19 3949- 
3971 PF00856A 26 14 4. t7Sf»..17 
3896-3933 


1783 


PF0062R 


PHD-fineer 


PF00628 15.84 7.750e~I0 38-53 
PF0062R IS 84 5 645e-09 164-170 


1783 


BL001I5 


Eukaryotic RNA polymerase II heptapeptido 
reneal oroteins 


BL001I5Z 3.12 4.971e-09 2575- 
2624 BLOOt 1 SZ 12 7 7S0e-OQ 
2582-2631 


1784 


DM01970 


0 kw ZK632 12 YDR313C ENDOSOMAL 


DM01970B 8 60 9 423e-10 111- 
124 


1785 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10 527 I74e-10 177- 
191 


1790 


PF00922 


Vftfllculrtvini*! nhninhnnmtfiti 


PF00922A 19 17 7 7?dp-00 27fi- 
310 


1792 


PD02059 


CORE POLYPROTEIN PROTEIN GAG 
CONTAINS: P. 


PD02059A 28.10 5.950e-10 34-75 


1794 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 8.065e-09 165- 
206 


1795 


BL00326 


Tropomyosins proteins. 


BL00326D 8.76 8.065e-09 173- 
214 


1797 


PR00563 


BETA-3 ADRENERGIC RECEPTOR 
SIGNATURE 


PR00563B 3.98 8.141 e-09 8^28 


1799 
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1801 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 8.800e-14 290- 
303 PD00066 13.92 4.000e-13 
234-247 PD00066 13.92 4.429e- 
12 262-275 PD00066 13.92 
9.21 7e-l 1 206-219 PD00066 
13.92 3.769e-10 505-518 
PD00066 13.92 4.1 15e-10 449- 
462 PD000G6 13.92 4.462e-10 
533-546 PD00066 13.92 6.538e- 
10 477-490 


1801 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 5.09le-15 243- 
257 PR00048A 10,52 7.750e- 14 
542-556 PR0U048A 10.52 3.647e- 
12 215-229 PR00048A 10.52 
4J76e-12 486-500 PR00048B 
6.02 6.000e-12 231-241 
PR00048B 6.02 6.000e-12 287- 
297 PR00048A 10.52 7.353e- 12 
187-201 PR00048A 10.52 6.684e- 
11 271-285 PR00048A 10.52 
4.130©- 10 299-313 PR00048A 
10.52 3.520e><09 430-444 
PR00Q48A 10.52 3.880e-09 514- 



310 



WO 02/070539 



PCT/US02/05095 



SEQ ID 
NO: 


Database 
entry IP 


Description 


Results* 






— r— ■ — 


528 PR00048A 10.52 6.400e-09 
458-472 PR00048A 10.52 &560e~ 
09 159-173 


1801 


BL00028 


Ziii^fingei\ C2H2 type, domain proteins, 
V 


BL00028 16.07 1.563c-I5 545- 
562 BLO0028 16.07 L450e-13 
218-235 BL00028 16,07 3.348e- 

lz4oyOUO dL.V\j[)Z& 10.U7 

7>652e-12 274-291 BL00028 

1 A f!7 O 10^»_1 1 All AGf\ 

BL00028 16.07 4.1 15e-l 1517- 

JJ*t DLUUU^O IO.U/ J. I^fC-il 

246-263 BL00028 16,07 LOOOe- 
5,200e~10 461-478 BL00028 
BL00028 16.07 1.257e-09 357- 
162-179 


1802 


BL0O615 


C-type lectin domain proteins* 


155 


1802 


PR00770 


EOSINOPHIL MAJOR BASIC PROTEIN 
SIGNATURE 


PR00770F 13.79 1.774e-09 198- 
218 


1802 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 2.521e-09 137- 
155 


1803 


BL00615 


C-type lectin domain proteins. 


BL00615A 16.68 8.920e-l 1 176- 
194 


1803 


PR00770 


EOSINOPHIL MAJOR BASIC PROTEIN 
SIGNATURE 


PR00770F 13.79 1.774e-09 237- 

257 


1803 


PR00356 


TYPE II ANTIFREEZE PROTEIN 
SIGNATURE 


PR00356B 14.85 2.52Ie-09 176- 
194 


1804 


BL00018 


EF-hand calcium-binding domain proteins. 


BL000I8 7.41 9.000e-14 65-78 


1806 


PR00019 


LEUCINE-RICH REPEAT SIGNATURE 


PR00019B 11.36 LOOQe-09 219- 
233 


1808 


PD02474 


SYNTHASE SMALL SUBUNIT 
ACETOLACT. 


PD02474B 21.08 8.568e-09 199- 
238 


1809 


PF00922 


Vesiculovirus phosphoprbtein. 


PF00922A 19.17 L000e-08 249- 
283 


1812 


PR00289 


DISINTEGRIN SIGNATURE 


PR00289B 1L79 1.947e-09 522- 
535 


1814 


PF00242 


DNA polymerase (viral) N- terminal domain 
proteins. 


PF00242F 12.18 8.522e-09 197- 
219 


1815 


PR00780 


LEUSERPIN 2 SIGNATURE 


PR00780B 4.89 4.491e-09 262- 
285 


1816 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 8.027e-I3 208^ 
255 


1817 


PD01876 


ANTIGEN MELANOMA-ASSOCIATED 
MULTIGENE FAMILY TUM. 


PD01876C 21.73 3.326e-15 481- 
534 PD01876C 21.73 3.045e-10 
735-788 


1818 


PR00747 


GLYCOSYL HYDROLASE FAMILY 47 
SIGNATURE 


PR00747C 12.06 8.767e4)9 337- 
356 


1820 


DM01782 


HYDROQENASE (FE) LARGE CHAIN. 


DM01782C 13.88 4.400e^l9 349- 
368 DM01782F9.01 4.375e-18 
499-515 DM01 782B 17.29 
3.412e~10 294-327 


1821 


BL00226 


Intermediate filaments proteins. 


BL00226D 19.10 7.375e-38 321- 
368 BL00226B23,86 7.l07e-32 
155-203 BL00226C 13.23 3.100c- I 
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S15QID 
NO: 


Database 
entry ID 


Description 


Results* 








19 220-251 BL00226A 12.77 
7.000e-15 55-70 BL00226D 
19.10 7.800e-09 254-301 


1822 


DM01415 


6 SALIVARY GLUE PROTEIN. 


DM01415B 13.78 9.51 8e-10 4-52 


1822 


BL00026 


rMiitin rppn^nifinn fit hirirlma Hnmain 

proteins. 




1822 


PD02283 


PROTEIN SPORULATION REPEAT 
PRECU. 


PD02283C 17.54 3.588e-09 1 10- 
138 PD02283C 17.54 3.588e-09 

1 1/10 


1822 


PD00866 


GLYCOPROTEIN PROTEIN SPIKE E2 


PD00866L3.73 1.443e-09 21-31 
rDUUoooL J. /J z./70e-09 97-107 
PD00866L 3,73 2.770e-09 1 46- 

130 rUUUoooJL $.15 z.yl 5S-U9 7- 
17 PD00S66L 3.73 2.91 8e-09 14- 
24 PD00866L 3,73 4.54le-09 4- 
14 


1822 


PR00858 


CRUSTACEAN METALLOTHIONEIN 
SIGNATURE 


PR00858B 5.93 2.819e-09 84-103 
PR00858B 5.93 5.021 e-09 1 14- 
133 PR00858B 5.93 5.021e-09 
124-143 


1822 


PR00875 


MOLLUSC METALLOTHIONEIN 
SIGNATURE 


PR00875B 5.24 6.595e-09 85-93 


1822 


PR00874 


FUNGI4V METALLOTHIONEIN 
SIGNATURE 


PR00874C 4.37 9.739e-10 73-88 
PR00874C 4,37 7.250e-09 83-98 


loll 




Integrins beta chain cysteine-rich domain 
proteins. 


BL0O2431 31.77 3.143e-U 13-56 
BL002431 31.77 3.647e- 10 26-69 
BL002431 31.77 6.426e- 10 106- * 
149 BL00243I 31.77 7.088e-10 
96-139 BL00243I31.77 9.338e- 
10 36-79 BL00243I3L77 L254e- 
09 46-89 BL002431 31.77 6.451 e- 
09 3-46 BL002431 3 1 .77 6.704e- 
09 77^120 BL00243I 31.77 
7.211e-09 67^10 BL00243I 
31.77 7.592e-091I6-IS9 
BL002431 31.77 8.606e*09 92-135 


1822 


BL00198 


4Fe-4S ferredoxins, iron-sulfur binding 
region proteins. 


BL00198 10.43 9.700e-09 6-18 


1822 


BL00203 


Vertebrate metaliothioneins proteins. 


BL00203 13.94 8.024C-14 102- 
148 BL00203 13.94 7,750e-I3 
127-173 BL00203 13.94 l.103e- 
12 97-143 BL00203 13.94 
3. 172e-12 103-149 BL00203 
13.94 3.379e-12 92-138 BL00203 
13.94 4.207e-12 98-144 BL00203 
13,94 4,207e-I2 116-162 
BL0D203 13 94 5 345e-l? 71-1 17 
BL00203 13.94 5.345e~12 L07- 
153 BL00203 13.94 9.897e- 12 
106-152 BL00203 13,94 l.791e- 
U U8-164 BL00203 13.94 
2.879e-ll 126-172 BL00203 
13.94 3.176e-11 87-133 BL00203 
13.94 4.758e-ll 61-107 BL00203 
13.94 5.846e-.il 113-159 
BL00203 13.94 6.044e-Il 112- 
158 BL00203 13.94 7.23 le- 11 
93-139 BL00203 13,94 1.287e-10 
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entry ID 
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128-174 BL00203 l%.94 2.245e- 
10 76-122 BL00203J3.94 
3.202e^l 023-69 BL|p203 13.94 
4.926e-l 078-124 Blf)0203 13.94 
6.362e-10 81-127 BI$)02O3 13.94 
6.553e~l 0 82-128 BL00203 13.94 
6.840e-10 111-157 BL00203 
13.94 8.851e-l0 43-89 BL00203 
13.94 8.851e-l0 96-142 BL00203 
13.94 2.837e-09 8»54 BL00203 
13.94 3.296e-09 72-1 18 BL00203 
13.94 3.847e-09 117-163 
BL00203 13,94 3.939e-09 86-132 
BL00203 13,94 5.592&-09 123- 
169 BL00203 13.94 5.776e-09 
108-154 BL002O3 13.94 6.1 43e- 
0930-76 BL00203 13.94 6.1 43e- 
09 79-125 BL00203 13.94 
6.969e-09 121-167 BL00203 
13,94 7,612e-09 16-62 BL00203 
13*94 7.796e-09 I0M47 
BL00203 13.94 8.163C-09 33-79 
BL00203 13.94 9.633e-09 77-123 

dt nnnni 11 o./i 1 nnn™ ad hc 111 
JtJLUU-AUJ 1J.94 l.UUUe-Uo 66-112 


1824 


PR00860 


VERTEBRATE METALLOTHtONEfN 
SIGNATURE 


PR00860B 7.04 2.929e-20 74-88 
PR00860A 5.46 5.655e-13 52-65 
PR00860C 9.61 2.400e-12 88-98 




i IvUUoJo 


SIGNATURE 


PRUU858B 5.93 l,4iye-ll 70-89 
PR00858B 5.93 7.070e-l 1 65-84 


1 R7j1 
1 (Wr 


r KvUO / *t 


r UrsOl-l V Mt 1 AL.LU 1 HlUNblfN 

SIGNATURE 


PR00874C 4.37 3,478e-10 64-79 


1824 




iniqgnns oum drain cystcinc^ncn aornain 
proteins. 


,pL*UUZ4Jri l/.ji /.o/De-iU 59-85 
BL002431 31.77 4.803e-09 65-108 


1 OAt 


PT}ftnft7ri 


■MPNJATAnP A/tPTAT I fYTTJTAMCMM 
lNxiMA I \JU& IVJlJc- 1 ALLU I rllUJN JilIN 

SIGNATURE 


JrKUUo fOU 0, / 1 £. I y 1 C- 1 0 62-75 
PR00876A 6,60 5.886e-09 61-74 


1824 


BL00203 


Vertebrate metallothioneins proteins. 


BL00203 13.94 I ,Q00e-40 62-108 

HT f\f\im 1 1 Qd. 7 A.'JQe* HQ <A 1 An 
DJuUVZUJ I J .7^ /r*rA7e-Uy 

BL00203 13.94 8.071e-09 52^98 
BL00203 13.94 8.806(^09 55-101 


1824 


PR00875 


MOT .1,1 JSC MRTAF J/YTHTONRTN 
SIGNATURE 




1825 


PR00360 


C2 DOMAIN SIGNATURE 


586 


1825 


PR00399 


SYNAPTOTAGMIN S1GNATIJRF 


PRfinlQQA 0 S7 ft R71p no irin 

jri\vUJ77/\ O.O/JC-l/y JOU- 

376 


1829 


DM0 1 206 


CORONAVIRUS NUCLEOCAPSID 
PROTEIN. 


DM01206B 10.69 8.767e-10 567- 
587 DM01 206B 10.69 t.000e-09 
563-583 


1829 


PD01351 


PROTEIN REPEAT NEUROFILAMENT 
TRIPL 


PD01351B 13.72 6.786e-10 196- 
222 PD01351B 13.72 2.597e-09 
198-224 


1829 


BL00035 


'POU' domain proteins. 


BL00035B 14.46 3.127e-09 634- 
655 


1829 


BL00229 


Tau and MAP proteins tiibu I in-binding 
domain proteins. 


BL00229A 23.57 3.1 82e-09 178- 
217 


1829 


PR00049 


WILM'S TUMOUR PROTEIN SIGNATURE 


PR00049D 0.00 3.647e-l 1 204- 
2 1 9 PR00049D 0.00 9.47 1 e- 1 1 
209-224 PR00049D O.OO 8.500e- 
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* 


10 206-221 PR00049D0.00 
8.500G- 10 207-222 PR00049D 
0.00 3.746e-09 182-197 


1829 


BL003 19 


Amyflidogenic glycoprotein extracellular 
domajhi proteins. 


BL00319C 17.12 3.132e-09 265- 
299 BL00319C 17.12 4.553e-09 
1013-1047 BL00319C 17.12 
5.6l8e-09 1019-1053 BL00319C 
37.12 7.395e-09 267-301 
BL00319C 17.12 7.632e-09 1017- 
1051 


1829 


DM00215 


PROUNE-RICH PROTEIN 3. 


DM00215 19,43 3.089e-10 185- 
218 DM00215 19.43 8.393e-10 
190-223 DM00215 19.43 2,373e- 
09 186-219 DM00215 19.43 
7J02e-09 183-216 DM00215 
19.43 8. 169e-09 188-221 


1829 


BL00422 


Gran ins proteins. 


BL00422C 16.18 8.588e-09 263- 
291 


1829 


BL00412 


Neuromodulin (GAP-43) proteins. 


BL00412D 16.54 9>182e-ll 1005- 
1056 BL00412D 16.54 9.120e-10 
1004-1055 BL00412D 16.54 
4.857e-09 1003-1054 BL00412D 
16.54 8.347e-09 1008-1059 
BL00412D 16.54 9.449e-09 1001- 
1052 


1829 


PR00832 


PAXILLIN SIGNATURE 


PR00832B 9. 87 9.526e-09 377- 
401 


1829 


PD02059 


CORE POLYPROTEIN PROTEIN GAG 
CONTAINS: P. 


PD02059B 24,48 9.620e-09 196- 
231 


1829 


PF00992 


Troponin. 


PF00992A 16.67 9.882e-09 1005- 
1040 


1829 


BL00048 


Protamine PI proteins. 


BL00048 6.39 6.949e-15 569-596 
BL00048 6.39 I.885e-14 56S-595 
BL00048 6.39 3.361e-14 570-597 
BL00048 6.39 8.377e-14 577-604 
BL00048 6.39 8.377e-14 578-605 
BL0004S 6.39 3.631e-13 571-598 
BL00048 6.39 4.738e-13 576-603 
BL00048 6.39 7.369e-13 582-609 
BL00048 6.39 2.456e-12 575-602 
BL00048 6.39 3.118e-12 573-600 
BL00048 6.39 3.515e-12 567-594 
BL00048 6.39 5.235e-12 581-608 
BL00048 6.39 5.632e=12 583-610 
BL00048 6.39 7.221 e- 12 557-584 
BL00048 6.392.250e-ll 572-599 
BL0U04S 6.39 2.875e-l 1 585-612 
BL00048 6.39 4.875e-l 1 586-613 
BL00048 6.39 5.375e-l 1 591-61 8 
BL00048 6.39 7.375e-ll 589-616 
BL00048 6.39 7.500e-l 1 580-607 
BL00048 6.39 8,625e~ll 588-615 
BL00048 6.39 2.895e-10 563-590 
BL00048 6.39 5.382e-10 574-601 
BL00048 6.39 5.500e-10 566-593 
BL00048 6.39 5.618e-10 584-61 1 
BL00048 6.39 9.171e 10 561-588 
BL00048 6.39 9.882e-10 592-619 
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BL00048 6.39 1.450e-09 590-617 
BL00048 6.39 3.925e-09 560-587 
BL00048 6.39 4,488e-09 562-589 
BL00048 6,39 4.938e-09 579-606 
BL00048 6,39 5.275e-09 564-591 
BL00048 6.39 5.725e-09 558-585 
BL00048 6.39 5.725e-09 587-614 
BL00048 6.39 6,625e-09 555-582 
BL00048 6.39 7.075e-09 556-583 
BL00048 6.39 9.438e-09 559-586 
BL00048 6.39 9,888e-09 600-627 


1829 


PD01234 


PROTEIN NUCLEAR BROMODOMAIN 

TT> A VIP 

TRANS. 


PD01234B 15.53 L000e-08 201- 
219 


1830 


BL00092 


N-6 Adenine-specific DNA methylases 

proteins. 


BL00092 5.35 2.000e-09 136-145 


1831 


PR00511 


TEKTIN SIGNATURE 


PR005I1A 13.59 3.700e-l4 1 13- 
130 


1833 


PR00764 


COMPLEMENT C9 SIGNATURE 


PR00764F 16.89 2.286e-09 158- 
179 


1833 


BL00022 


EOF-like domain proteins. 


BL00022B 7.54 4.600e-09 138- 
145 


1833 


BL01187 


Calcium-binding EGF-like domain proteins 
pattern proteins* 


BL01187B 12.04 4.086e-10 167- 
1 83 BLO 1 1 87B 12.04 4.600e-09 
104-120 


1833 


BL01185 


C-terminaL cystine knot proteins. 


BL01185B 21.14 6.929e-09 1 46- 
195 


1833 


PR00010 


TYPE II EGF-LIKE SIGNATURE 


PR000I0C 11.16 7.429e-09 109- 
120 PR00010C 11.16 7.643e-09 
172-183 PR00010All.79 8.846e- 
09 190-202 


1833 


PR00011 


TYPE IE EGF-LIKE SIGNATURE 


PR00011A 14,06 4,822e~09 208- 
227 PR0001 ID 14,03 8.957e-09 
67-86 


1833 


BL00243 


Integrtns beta chain cysteine-rich domain 
proteins. 


BL002431 31.77 1.000e-08 34-77 


1835 


BL01279 


Protein-L-isoaspartate(D-aspartate) 0- 
methyl transferase sigtia. 


BL01279A 24.27 3.691e~09 419- 

467 


1835 


BL0U31 


Ribosomal RNA adenine dimethylases 
proteins. 


BL01131A 26.62 4.600e-Q9 421- 
467 




BL01230 


RNA methyltransrerase trmA family proteins. 


BLO 1 23 0E 15.79 6.607e-Il 571- 
587 BL01230A 17,88 8.962oI0 
40y-4^8 oLOlzJUB 11.62 8.475e- 
09 436-449 


1 ODD 


rKUUU4y 




DID (\f\C\A C\T\ A AA O nttf\^ AA f AO 

rKU0U4yU O,00 o.7o0e-0y 59o- 
613 


1837 


PD01719 


PRECURSOR GT YmPROTFfW ^THNAT 
RE. 


ryyi ij,.oy z.ova?c-j i Ajy~ 
287 PD01719A 12.89 8.105e-10 
199-227 


1838 


BL01162 


Quinone oxidoreductase/ zeta-crystallin 
proteins. 


BL01162C 22.80 1.269e-18 151- 
195 BL01162A 15.38 1.265e-ll 
64-87 


1838 


BL00279 


Membrane attack complex components / 
perforin proteins. 


BL00279C 3L643.156e-09 134- 
188 


1838 


BL00059 


Zinc-containing alcohol dehydrogenases 
proteins. 


BL00059B 16.08 7.273e-09 93- 
121 


1841 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 5.415e-26 46-85 
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f^H-IO TVD17 7ThTf~* "VYKlf^VHy Ol/^MATTIDC 


tit) r\r\ r\ a OA i a £• Tiin 11 1 a(v9 

FKUUU4KA 1U.3z 3.74/e-ll 140- 
154 PR00048A 10,52 6.0876-101 

lit) -J. JO f| 


1841 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 4.130e-12 227- $ 
244 BL00028 16.07 7.577e- 11 \ 
352-369 BL00028 16.07 3.400e- l> 
10 380*397 BL00028 16.07 
6.400e-10 199-216 BL00028 
16.07 I.257e-09 143-160 

TIT Haato \C i\n o aoa„ An ni 
oJLUUUZK 10.0/ ^.Ozye-0y 171- 

188 BL00028 16.07 5.886e~09 


1841 


PR00967 


ACUTE MYELOID LEUKEMIA 1 
PROTEIN SIGNATURE 


PR009671 12.41 8.l30e-09 466- 
481 


1841 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PDQ0066 13.92 2.038e-10 187- 
200 PD00066 13.92 L600e-09 
396-409 PD00066 13.92 9. 400e- 
09 215-228 


1841 


BL00479 


Phorbol esters / diacylglycerol binding 
domain proteins. 


BL00479A 19.86 6.553e-09 214- 
237 BL00479A 1 9.86 9.809e-09 
367-390 


1842 


PD02910 


TRANSCRIPTION PROTEIN FACTOR 
REGULATION A. 


PD02910A 15.43 9.839e*09 62-97 


1843 


PD02199 


SUBUNIT HYDROGEN ION TRANSPORT 
T. 


PD02199A 20.58 1.000e-4O 10-61 
PD02199D 13.18 l.OOOe-40364- 
405 PD02199F 15.02 l.OOOe-40 
440-482 PD02199J 11.42 LOOOe- 
40 723-762 PD02199K 15.22 
1.000e-40 792-831 PD02199G 
9.43 4.447e-24 531-555 
PD02199B 27.90 1.474e-22 263- 
JUo ruvllyyri I j.o/ z.oJoe-21 
576-599 PD02199E7.56 8.642e- 
19 405-424 PD02199C 17.60 
8.085e-14 313-329 PD02199I 
8.90 4.780e-09 616-624 


1844 


BL002I8 


Amino acid permeases proteins. 


BL00218E 23.30 5.920e-10 343- 
383 


1845 


BL00048 


Protamine PI proteins. 


BL00048 6.39 9.526e-10 160-187 


1845 


BI ,00422 


Granins proteins. 


BL00422C 16.18 4.000e-09 590- 
618 


1845 


PR00833 


POLLEN ALLERGEN POA PI 
SIGNATURE 


PR00833H 2.30 8,385e-09 943- 
958 


1845 


PF00992 


Troponin. 


PF00992A 16.67 7.900e-13 568- 
603 PF00992A 16,67 4.090e- 11 
566-601 PF00992A 16.67 5.8 17e- 
W o /U-OUp rrlMJyyzA Io.o/ 

8.479e-10 579-614 PF00992A 
16.67 2.066e-09 564-599 
PF00992A 16.67 4.789e-09 575- 
610 PF00992A16.67 4.908e-09 
532-567 PF00992A 16.67 6,803e- 
09 536-571 PF00992A 16.67 
7.632e-09 562-597 PF00992A 
16.67 8.697e-09 585-620 
PF00992A 16.67 9,053e-09 583- 
618 PF00992A 16.67 9.289e-09 
516-551 
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1845 


PF01140 


Matrix protein (lsSA) > pi 5. 


PF01140D 15.54 8.500e-12 505- 
540 PF01140D 15.54 7.120c-ll 
584^619 PF01140D 15.54 9.760e- 
11 586-621 PF01140D 15.54 
3.813e-10 588-623 PF01140D 
15.54 4.938e-10 563-598 
PF01140D 15.54 6.738e-10 519- 
554 PFOU40D 15.54 8.31 3e-10 
503-538 PF0I140D 15.54 9.325e- 
10 549-584 PF01140D 15.54 
9.663e-10 567-602 PF01140D 
15.54 9.775e-10 565-600 
PF01140D 15.54 1.000e-09 582- 
617 PF01I40D 15.54 2,884e-09 
575-610 PF01140D 15.54 3 J98e- 
09 551-586 PF01140D 15.54 
3*302e-09 517-552 PF01 MOD 
15.54 4.l40e-09 533-568 
PF01140D 15.54 4.872e-09 578- 
613 PF01I40D 15.54 6.860e-09 
581-616 PF01140D 15.54 7.174e- 
09 594-629 PF0U40D 15.54 
7.384e-09 576-611 PF0I HOD 
15.54 7.593e-09 572-607 
PF01140D 15.54 8.640e-09 570- 
605 PF01140D 15.54 8.744e-09 
507-542 PF01140D 15.54 8.744e- 
09 596-63 1 PF0 1 1 40D 1 5,54 
9. 1 63e-09 577-6 12 PF011 40D 
15.54 9.26 /e-09 5/9-614 
PF01140D 15.54 9.791e-09 574- 
609 PF0H40D 15.54 1.000e-08 
531-566 


1848 


BLO0811 


Oicosins proteins. 


BL00811B 10,57 9,7916-09307- 
336 


1852 


BL00415 


Synapsins proteins. 


BL00415N 4.29 4.153e-09 301- 
345 


1852 


DM00668 


ZEIN. 


DM00668B 22,01 8.018e-09 291* 
343 


1853 


BL00107 


Protein kinases ATP-binding region proteins. 


BL00107A 18.39 3.250e-I7 133- 
164 






i YKUMlNb fvlNAbb 1 ALY IIC 
DOMAIN SIGNATURE 


PR00109E 14.41 3.045e-l 1 254- 
277 


1853 


PF00791 


Domain present in ZO-1 and Unc5-like netrin 
receptors. 


PF00791B 28.49 2.3 17e-16 689- 
744 PF00791B 28.49 3.753e-15 
523-578 PF00791B 28.49 4.3 16e- 

1.727e-ll 589-644 PF00791B 
28.49 3.636e-l 1 556-61 1 
PF00791C 20.98 4.913e-ll 570- 
609 PF00791B 28.49 6.330e-t0 
722-777 PF00791C 20.98 5.853e- 
09 703-742 


1853 


PF00023 


Ank repeat proteins. 


PF00023A 16.03 5.200e-13 722- 
738 PF00023B 14.20 1.000c- 12 
652-662 PF00023A 16.03 2,000e- 
12 755-771 PF00023A 16.03 
7.857e-Il 656-672 PF00023A 
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k 

! 






16.03 8.286G-11 622-638 
rrO0023B 14.20 4.6o2q-Q9 519- 
529 PF00023A 16.03 6.143e-09 
-)By-0U3 rfUUUzjA 16.U3 o.786e- 
09 689-705 


lfSDD 


priflfilY7SI 
1 IJUUU/o 


KJbrbA I rRU 1 tilJN JNULLbAK 

ANKYR. 


rUUUO/oB 1 J.14 y.OU0c-l 1 582- 
595 PD00078B 13.14 8.435e-09 

O'tj-QoZ 


1854 


PR00449 


TRANSFORMING PROTErN P2 1 RAS 
SIGNATURE 


PR00449A 13.20 4.971e-14 4-26 


1 8 jo 


UT rttfVl 1 *t 
rJLUOj 1 / 


WAP-type 'four-disttlfide core* domain 
proteins. 


BL00317B 14.58 3.550e~13 48-70 


1856 


PR00003 


4-DISULPHIDE CORE SIGNATURE 


PR00003C 7.69 6.357e-09 54-64 


1857 


BL01019 


ADP-ribosylation factors family proteins. 


BL010I9B 19.49 7.517e-21 95- 
150 


1857 


BL01020 


SARI family proteins. 


BL01020C 15.35 2.301e-18 79- 
130 


1857 


YS.VS. rt A Aj A A 

PR00328 


OTP-BINDING SARI PROTEIN 
SIGNATURE 


PR00328C 13.16 2.841e-10 78- 
104 


1858 


PD00066 


PROTEIN ZINC-FINGER METAL-BIN DL 


PD00066 13.92 2.385e-l5 128- 
141 PDO0O66 13.92 5.714e~12 
100-113 


1858 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048B 6.02 7,188e-10 97-107 
PR00048B 6.02 3.842e-09 125- 
135 PR00048A 10.52 6,040e- 09 
137-151 


1859 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 1.000e-U 174- 
188 PR00048B 6.02 1.692e~l I 
162-172 


1859 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDI. 


PD00066 13.92 3.739e-l 1165- 
178 


1859 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL0G028 16.07 2.385e-U 177- 
194 BL00028 16.07 3J69e-l 1 
121-138 BL00028 16.07 8.269e- 
11 149-166 


1860 


P£00651 


BTB (also known as BR-C/Ttk) domain 
proteins. 


PF00651 15.00 2.895e-l 1 45-58 


1860 


PD00066 


PROTEIN ZINC-FINGER METAL-BINDL 


PD00066 13.92 2.03$e-10 419- 
432 PD00066 13.92 7.231e-10 
391-404 


I860 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 5.696e-I0 400- 
414 PR00048A 10.52 8.435e-10 

A*}Q AA^i 

428-442 


1860 


BL00028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 8.269e-ll 403- 
420 BL00028 16.07 3.400e-10 

J f J Jjl SDLtUUuZtO IO.U/ j.UJ/G- 

09 431-448 


1861 

* 


DM00547 


1 kw CHROMO BROMODOMAIN 
SHADOW GLOBAL. 


DM00547F 23.43 7.643e-34 606- 
653 DM00547B 1 1.28 7.907e-16 
155-169 DM00547C 17.30 
8.650e-14 209-231 DM00547D 
11.60 6.500e-l 3 277-291 
DM00547E 13.94 L000e-11 307- 
330 


1861 


BL00039 


DEAD-box subfamily ATP-dependent 
helicases proteins. 


BL00039D 21.67 6.379e-10 590- 
636 


1862 


BL01115 


GTP-bindin§ nuclear protein ran proteins. 


BL01115A 10.22 5.330e-U 18-62 
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1862 


BL01019 


ADP-ribosylation factors family proteins. 


BL010I9A 13.20 I.809e-09*52-92 


I502 


nn AA,1 A c\ 

PR00449 


TRANSbORM ING PROTEIN P21 RAS 
SIGNATURE 


PR00449C 17.27 3.647e-19S9-82 
PR00449A 13.20 7.000e-15 §8-40 
PR00449D 10.79 8.875e-14 121- 
135 PR00449E 13.50 S.920e|l4 
157-180 PR00449B 1434 8.500e- 
09 41-58 


IdO/ 


JJLOlzeJ 


T-box domain proteins. 


BL01283D 11.70 7.868e-31 59-92 
BL01283C 13.05 2.537e-14 25-39 


1867 


PR00937 


T-BOX DOMAIN SIGNATURE 


PR00937D 13.41 5.378e-15 24-39 
PR00937F 12.53 l.450e=l2 83-92 
PR00937E 1 1.86 5.592e-12 62-76 
PR00937C 10.51 5.219e-10 5-15 


1870 


DMU 1 803 


1 HERPESVIRUS GLYCOPROTEIN H. 


DM01803A 10.51 8.699e-09 100- 
121 


1872 


BL00470 


Isocitrate and isopropylmalate 
dehydrogenases proteins. 


BL00470A 16.25 5.179e-14 10-31 
BL00470C 15.43 4.103e-10 223- 
238 BL00470E 16.52 1.900e-09 
287-297 


1873 


PF00023 


Ank repeat proteins* 


PF00023A 16.03 3.893e-09 44-60 
PF00023B 14,20 9.182e-09 40-50 


1874 


PFOO023 


Ank repeat proteins. 


PF00023A 16.03 3.893e~09 72-88 
PF00023B 14.20 9.182e-09 68-78 


1877 


PD01066 


PROTEIN ZINC FINGER ZINC-FINGER 
METAL-BINDING NU. 


PD01066 19.43 2.019e-26 51-90 


1877 


BLOO028 


Zinc finger, C2H2 type, domain proteins. 


BL00028 16.07 6.786e-16 427- 
444 BL00028 16.07 1.900e»13 
287-304 BL00028 16.07 3.700e- 
13 481-498 BL00028 16.07 
1.000e-12 315-332 BL00028 
16.07 1.000e-12 399-416 
BL00028 16.07 3.348e-12 453- 
470 BL00028 16,07 4.522e~12 
371-388 BL00028 16.07 6.885e- 
11 343-360 BL00028 16,07 
4.600e-l 0 509-526 


1877 


PR00048 


C2H2-TYPE ZINC FINGER SIGNATURE 


PR00048A 10.52 8.500e-l7 424~ 
438 PR00048A 10.52 7.000e-14 
312-326 PR00048A 10.52 1.643e- 
13 396-410 PR00O48A 10.52 
3.57le-13 478-492 PR00048B 
6.02 9.000e-12 300-310 
PR00048A 10,52 1.000e-ll 506- 
520 PR00048A 10.52 5,737c- 11 
340-354 PR00048A 10.52 1.39le- 
10284-298 PR00048B6.02 

£. {\6'2a i ft A 1 0 A O'* T>t> AAA/1 Ot> 

6.02 1.474e-09 494-504 
PR00048B 6.02 2.895e-09 356- 
366 


1877 


PD00066 


PROTEIN ZINC-FINGER METAL-B1NDI. 


PD00066 13.92 4.600e-14 415- * 
428 PD00066 13.92 7.000c- 14 
469-482 PD00066 13.92 5.500s- 
13 303 316 PD00066 13.92 
4.429e-l 2 331-344 PD00Q66 
13.92 9.217e-1 1 497-510 
PD00066 13.92 2.038e-10 387- 
400 PD00066 13,92 6.400e-09 
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359-372 


1878 


PR0083O 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (SI 6) SIGNATURE 


PR00830A 8,41 4.927e-13 222- 
242 


1878 


PR00300 


ATP-DEPENDENT CLP PROTEASE ATP- 
BINDING SUBUNIT SIGNATURE 


PRQ0300A9.56I.545e-ll 218- 

237 


1878 


PR00051 


BACTERIAL CHROMOSOMAL 
REPLICATION INITIATOR (DNAA) 
SIGNATURE 


PR00051A 10,68 3.647e-10 215- 
236 


1878 


PR00819 


CBXX/CFQX SUPERFAMILY 
SIGNATURE 


PR0O819B 10.83 9.213e-10 217- 
233 


1878 


BL00674 


AAA«protein family proteins. 


BL00674B 4.46 9.000e-24 215- 
237 BL00674C 22.60 8.448e-20 
248-291 BL00674D 23.41 
5.140e-18 308-355 BL00674E 
15.24 9,2!7e-16 390-410 
BL00674A 16.91 5.304e-O9 181- 
202 


1878 


BL00113 


Adenylate kinase proteins. 


BL001I3A 12.74 8.615e-09 219- 
236 


1879 


BL00456 


Sodium :solute symporter family proteins. 


BL00456A 22.59 l,957e-32 35-90 
BL00456B 18.94 9.780e-17 111- 
141 


1880 


BL00456 


Sodium :solute symporter family proteins. 


BL00456A 22.59 1.957e-32 35-90 
BL00456C 24.55 L225e-31 173- 
228 BL00456B 18.94 9/780e-I7 
111-141 


1884 


BL00290 


Immunoglobulins and major 
histocompatibility complex proteins. 


BL00290A 20.89 l,818e-ll 159- 
182 


1885 


PR00830 


ENDOPEPTIDASE LA (LON) SERINE 
PROTEASE (SI 6) SIGNATURE 


PR0O830A 8.41 5.897e-I0 352- 
372 


lotto 


r> r A A O A?J 

BL00847 


MCM family proteins. 


BL00847D 15.16 8.568e-25 343- 
384 BL00847B 24.76 8.971e-25 
194-237 BL00847C 18.79 9.270e- 
10 3O1-33S 


looD 


rKUU44V 


IKANbrUKMlNCjFROIEIN P21 RAS 
SIGNATURE 


PR00449A 13.20 1.93le-09 346- 
368 


1 OOJ 




AAA- protein family proteins. 


BL00674B 4.46 5.378e-09 345- 
367 


1 oot> 




diu (also Known as oK-wittc; domain 
proteins. 


tjt?AACci i c Af\ i i\r\f\ t a £?f\ 

rrUUtol 1D.00 /.OUOe- 10 50-63 


1 OOO 




T<T17T PH PRPPAT QTfVM A TT TD C 

lSJiLi^>rl xvorliAl blwIN/V I UKfcv 


DDnncrti A O C one* nn cat 

mUU5UlA 8,25 5.295e-09 507- 
521 


1886 




DIHYDROPTERIDINE. 


um\}\)\jyy]5 14. 15 o.o25e-0y 479- 

489 


1887 




RE. 


rUULUUoA lU.zo 6*o25e-l3 544- 
558 


1887 


PD01364 


MUCIN GLYCOPROTEIN PRECURSOR 
MEM. 


PD01364B 13.94 5.500e-09 544- 
560 


1888 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6,625e-I3 544- 
558 


1888 


PD01364 


MUCIN GLYCOPROTEIN PRECURSOR 
MEM. 


PD01364B 13.94 5.500e-09 544- 
560 


1889 


PD00306 


PROTEIN GLYCOPROTEIN PRECURSOR 
RE. 


PD00306A 10.26 6.625e-13 544- 
558 


1889 


PD01364 


MUCIN GLYCOPROTEIN PRECURSOR 
MEM. 


PD01364B 13,94 5.500e-09 544- 
560 


1890 


PF00938 


Lipoprotein. 


PF00938E 19.50 6.096e-09 272- 



320 



WO 02/070539 



PCT/US02/0509S 



SEQID 
NO: 


Database 
entry ID 


Description 


Results* 






— — ™j 


307 


1891 


PF00925 


GTP cyclohydrolase II. | 


PF00925F 13.23 9.850e^09 356- 
367 


1893 


BL00226 


Intermediate filaments proteins. f 
„ «\ — 


BL00226A 1177 5.355e-13 139- 
154 


1895 


PF00035 


Double-stranded RNA binding motif. 


PF00035B 12.08 7.750e-09 273- 
287 


1896 


FF00622 


Domain in SPla and the RYanodine Receptor. 


PF00622B 21,00 9.250e-ll 170- 
192 



TABLE 4 



Cjm\j\I JUL? 




T^oc/>vJ nfinii 




Score 


Repents 


Position 


950 


UCH-2 


TTnlfltiltltl Pfll'nrtV vl—lf^irn infil KivrlrrilticM 
U UllJLIiUlL WuUvAjrl LGJHIilltu lljrUlUItiae 


f\ fa-Oik 












family 










950 


UCH-1 


T Ihinuitfn rnrhfivvl-fAfmirifil Vivrlrnlac^c 

\-/UlUUlLIll VCU UUA Y 1 kwllll jiiai iiyui VrinovS 




fid A. 


— 


/oo-zy / 






famil 










952 


efhand 


EF hand 


0.0034 


24.0 


-: 




952 


Adeno E 


Early EI A protein 


8.9 


-1683 









1A 












953 


SH2 


SH2 domain 


1.5e-16 


68.4 


_ ■ 


320-396 


954 


SH2 


SH2 domain 


1.5e-16 


68.4 


; 


347-423 


955 


RCC1 


Regulator of chromosome condensation 


1.3e-13 


58.6 


4 








(RCC1) 






















249:318" 














367:370- 














418 


958 


BRCT 


BRCA1 C Tcixninus (BRCT) domain 


7.2 


1.2 


1 


45-130 


962 


UQ con 


Ubiquitin-conjugating enzyme 


5.4e-39 


143.0 


1 


2-117 


963 


UQjJon 


Ubiquitin-conjugating enzyme 


L6e-57 


204.5 


1 


2-132 


971 


kinesin 


ICinesin motor domain 


2.2e- 


526.3 


1 


47-372 








154 








971 


WD40 


WD domain, G-beta repeat 


le-53 


191.9 


7 


1327- 














1361:1367- 














1402:1432- 














1466:1472- 














1511:1523- 














1557:1564- 














1600:1606- 














1640 


971 


filament 


Intermediate filament protein 


2.6 


-195.6 


1 


423-805 


971 


HDV ag 


Hepatitis delta virus delta antigen 


6.2 


-47.5 


1 


703-880 


971 


PFEMP 


Plasmodium falciparum erythrocyte 


8.9 


-86.8 


1 


475-585 






membrane p 










971 


G6PD 


GIucose»6-phosphate dehydrogenase, 


9.6 


-123.8 


1 


912*1049 






NAD bindi 










971 


DUF232 


Putative transcriptional regulator 


9.7 


-30.1 


1 


616-750 


975 


Kelch 


Kelch motif 


5.7e-62* 


219.3 


4 


267- 














312:314- 














359:361- 














406:408- 














453 


975 


BTB 


BTB/POZ domain 


3.1e-38 


140.4 


1 


23-130 


977 


kinesin 


Kinesin motor domain 


9.7e- 


520.8 


1 


53^353 
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E-valuc 


Score 


Repeats 


Position 








153 








977 


FHA 


FHA domain 


O.T>78 


16.9 





470-534 


977 


KMP11 


Kinetoplastid membrane protein 1 1 




-3.2 




355-438 


977 


SART-1 


SART-t family 


^| 


-360.1 


j 


325-881 


977 


Transposa 
se 22 


LI transposable element 




-180.7 


1 


322-622 


979 


Ribosoma 
1 L10 


Ribosomal protein LIO 




-33.3 




70-173 


980 


Aa trans 


Transmembrane amino acid transporter 


1.6e-75 


264.3 




69-479 


980 


oxfdored_ 
al 




NADH-Ubiquinone/plastoquinone 


7.6 


-169.3 


1 


63-326 


980 


xanjirjse 

mi ease 


Permease family 


8.4 


-201.3 




137-471 




Trn Tvr 

perm 


THrvntYinVifln/twrt^inp nprni(*M<:p fiittiHv 


9.6 


-297.9 




7ft A1A 


982 


HYR 


HYR domain 


2e-35 


13L1 


2 


106- 

I O / . 1 OO" 

267 


982 


EGF 


EGF-Iikc domain 


3.9e-24 


93.6 


3 


742- 
77V7R0- 
8U;818- 
849 


982 


sushi 


Sushi domain (SCR repeat) 


0.28 


12.5 


3 


1-38:43- 
104:272- 
332 


982 


TNFR c6 


TNFR/NGFR cysteine-rich region 


0.74 


9.6 




563-601 


982 


lam in in E 
GF ~ 


Laminfn EGF-Iike (Domains III and V) 


6.6 


-12.8 




746-79 1 


982 


metalthio 


Metal lothi one in 


7.6 


-11.6 




744-804 


982 


HMG_Co 
A synt 


Hydroxymethylglutaryi-coenzyme A 
synthas 


8.7 


-342.7 


1 


336-625 


986 


MHCJL 
alpha 


Class 11 histocompatibility antigen, alp 


8.8e-13 


55.9 


1 


37-106 


986 


. &._ 


Immunoglobulin domain 


8e-05 


29.5 




122-186 


987 


LRR 


Leucine Rich Repeat 


2.6e-12 


54.3 


4 


68-91:92- 
114:115- 
137:138- 
159 


987 


UVR 


UvrB/uvrC motif 


5.3 


-L5 


1 


453-486 


990 


UVR 


UvrB/uvrC motif 


3.8 


-0.3 


1 


552-588 


991 


CK_II_be 
ta 


Casein kinase 11 regulatory subunit 


4.5e-69 


242.9 


1 


5424 


994 


RNAj)ol 
B 


RNA polymerase beta subunit 


0 


1199.4 


1 


26-1010 


994 


PHD 


PHD-fmger 


5.9 


-17.0 


1 


1013-1048 


995 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


4.9e-05 


30.2 


1 


10-59 


995 


zf-B box 


B-box zinc finger 


0.0022 


24.7 


1 


92-134 


996 


HLH 


Helix-loop-helix DNA-binding domain 


2e-07 


38.1 


1 


276-327 


997 


ras 


Ras family 


3,3e-12 


2.0 


1 


23-145 


998 


pkinase 


Protein kinase domain 


3.1e-08 


-164 


I 


1*139 


1000 


ig 


Immunoglobulin domain 


2.7e-06 

* 


34.4 


2 


42-95:225* 
281 


looi 


Yj>hosph 
atase 


Protein-tyrosine phosphatase 


4.2 


-86.2 


1 


180-409 


1002 


PX 


PX domain 


5.8e-19 


76.4 


1 


22-138 


1002 


Peptidase 
S21 


Assembiin (Peptidase family S21) 


8,9 


-173.6 


1 


76-352 


1003 


Y^phosph 


Protein-tyrosine phosphatase 


LI 


^79.3 


1 


98-327 
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Repeats 
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atase 












1004 


Y_phosph 
atase 


Protein-tyrosirle phosphatase 


4.2 


-86.2 


1 


180-409 


1008 


oxidored 


NADH-Ubiquaone/plastoquinone 
(complex I) B 


5,2 


-165.9 


f 


64-311 


1010 


filament 


Intermediate fifiment protein 


1.7 


-190.5 


j 


331-647 


1010 


Tektin 


Tektin family w 


1.8 


-228.5 




192-507 


1010 


bZIP 


bZIP transcription factor 


4.4 


-3.7 




253-317 


1010 


spectrin 


Spectrin repeat 


5,6 


-18.0 


1 


320-429 


1010 


SART-1 


SART-1 family 


8 


-362.0 




54-675 


1010 


Myosin t 
ail 


Myosin tail 


9.8 


-555.1 




6-734 


1013 


Defensin_ 
P ro P e P 


Defensin propeptide 


1.2e-26 


102.0 




45-97 


1013 


defensins 


Mammalian defensin 


7.3e-I4 


59.5 




110-138 


1014 


filament 


Intermediate filament protein 


0.69 


-180.5 




314-579 


1014 


PolyA_po 
1 


Poly A polymerase family 


2.5 


-64.5 




348-463 


1014 


ERM 


Ezrin/radixin/moesin family 


8.7 


-223.3 




226-487 


1014 


Transposa 
se 12 


Transposase 


9.4 


-152.0 




155465 


1015 


zf-C2H2 


Zinc finger, C2H2 type 


1.2e-55 


198.2 


13 


129- 

152:349- 
371:379- 
401:407- 
429:446- 
468:474- 
496:505- 
527:533- 

925:931- 
953:959- 
981:987- 
1010 


1015 


60sjribos 
omal 


60s Acidic ribosomal protein 


0.23 


-21.0 


1 


61-194 


1015 


TFIIS 


Transcription factor S-II (TFIIS) 


0.82 


2,1 


1 


446-484 


1015 


rubredoxi 
n 


Rubredoxin 


2.8 


-8.3 






1015 


zf-BED 


BED zinc finger 


9 


-7.0 


1 


972-1011 


1021 


SSF 


Sodium :solute symporter family 


L7e-05 


-65.8 


1 


5-184 


1028 


2f-C2H2 


Zinc finger, C2H2 type 


4.3e-30 


113.4 


5 


100- 

122:132- 
154:160- 
182:188- 
210:216- 
238 


1028 


KRAB 


KRAB box 


7.1e-24 


92.8 


1 


4-44 


1028 


zf-BED 


BED zinc finger 


0.63 


3.4 


I 


78-123 


1029 


fn3 


Fibronectin type III domain 


1.8e-13 


58.2 


1 


146-231 


1029 


H 


Immunoglobulin doiftain 


0.0013 


25.5 


1 


275-335 


1031 


polypreny 
1 _synt 


Polyprenyl synthetase 


0.013 


=82.6 


1 


119-332 


1032 


Defensin^ 
propep 


Defensin propeptide 


1.2e-26 


102.0 


1 


68-120 


1032 


defcnsins 


Mammalian defensin 


7.3e-14 


59.5 


1 


133-161 


1033 


MAGE 


MAGE family 


3.8e-34 


126.8 


1 


1-208 
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lVf nrlpl 
1T1UUCI 


JJcaCripiIOIl 


lii-vmue 


Score 


Repeats 


Position 




























\y? 1 .Oy/- 














71 0-7<1 














/ / J. /ol- 














OV/j.Ov-'' 














R31 -R37- 

I . OJ / 














859:865- 














8R7'R93- 














915 


1084 


CBM 1 


Fungal cellulose binding domain 


2.6 


4.4 


1 


38-66 


1084 


LIM 


1 JlVf domain 


3 


-15.7 


l 


7R3-R47 


1084 


zf-BED 


KJkJJL-' £illlv llllgWl 


5,9 


-5.3 


9 


?ft^ 














7Ad.-7'?7 














77 £ 






QP17L.1 Anmaift / Rand 7 familvr 

orrn uomain/ Dana / iamny 


ft Ip /L9 


1 19 1 
13£.J 


1 
1 


1Q 91/1 






1 I K. UQllltlin 


A Cq I JT 


00*0 


I 


58-91:92- 














125 




Will 


wtll domain 


O.0U17 


11.0 


1 


11-119 


1091 


zf-DHHC 


DHHC zinc finger domain 


0.033 


-11.5 


I 


120-158 


1094 


Calx-beta 


Calx-beta domain 


0.19 


-11.6 


1 


23-117 


1095 


zf-C2H2 


Zinc finger, C2H2 type 


5.4e-82 


285.8 


12 


288- 














311:337- 














359:365- 














387:393- 














415:421- , 














443:449- 














47 1 :477- 










■ 




499:505- 














527:533- 














JJJ.JOl" 




























01 1 .01 /- 
















1095 


SCAN 


SCAN domain 


1 .5e-54 


194.6 


) 




1095 


zf-BED 


RPT*i wine fino/*r 




-3 0 


7 






























612 


1097 


7tm 2 


t W ClllpLllulllL/l tHIV-f 1 lA>V^|s IV/I yK?V> v"l V III! 


6.8e-21 


82.8 


1 
i 








family) 










1097 


GPS 


Latrophilin/CL-t-like GPS domain 


9.5e-13 


55.8 


1 




1097 


01 & 


C* flprrftn*? 5?T0 fa mil v inlpDral 
vj,dci^aii>3 ui^ itiiiiii^ uniaxial 


4.5 


-217.5 


1 

i 


3ftQ-SfiS 






membrane nrote 










1099 


lectin c 


Lectin C-tvne domain 


0.00 11 


7.2 


| 


6-100 


1100 


PDZ 


PT*>7 riom flirt fAl«;n known a<i DHR ro* 


ft flOtd 


3 


1 
i 








GLGF) 










1100 




Tvmrwirim jdS/7iJK ft tirrifrMti 


1 R 




1 
l 






kd 70kd 












1101 


cadherin 


Cadherin domain 


8.9e-95 


328.3 


5 


64- 














156:170- 














265:279- 














381:394- 














485:498- 














595 


1101 


Cadherln_ 


Cadherin cytoplasmic region 


4.7C-80 


279.4 


1 


643-794 




C term 












1103 


COesteras 
e 


CarboxylesterasG 


0.98 


-265.9 


1 


31-265 


1104 


DSPc 


Dual specificity phosphatase, catalytic 


9.7e-30 


112.2 


1 


133-315 
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doma 










1105 


efhand 


EF hand 


3.1e-21 


84.0 


\ 


124- 

152:160- 
188:208- 


1108 


zf-C2H2 


Zinc finger, C2H2 type 


3.5e-68 


239.9 


10 * 
• 


1 o/ 

212:240- 

262:268^ 

347:353- 
375:382- 
404:909- 
931:937- 
960:966- 
988 


no8 


SET 


SET domain 


0.0012 


-18.9 


1 


37-175 


1108 


zf-BED 


BED zinc finger 


0.1 


10.4 


2 


276- 

320:922- 

961 


1108 


FYVE 


FYVE zinc finger 

"III ■ ■— - : E ■ ■■ ■ 


6.8 


-24.6 


I 


262-364 


1109 


Nucleosid 
e tra2 


Na+ dependent nucleoside transporter 


2.5e- 
187 


635.7 


I 


198-613 


1109 


TLC 


TLC ATP/ADP transporter 


5.6 


-382.0 




95-407 


1109 


ATP- 
synt A 


ATP synthase A chain 


6.5 


-79.5 


> 


365-503 


1110 


PHD 


PHD-finger 


5,2 


-16.5 




77-120 


1113 


Peptidase 
C48 


Ulpl protease family, C-terminal cataly 


6.5e-34 


126.1 




254-415 


1114 


ras 


Ras family 


5.5e-3 1 


116.4 




54-222 


1114 


arf 


ADP-ribosylation factor family 


0.0054 


-80.3 


1 


37-213 


1115 


SPRY 


SPRY domain 


7.3e-10 


46.2 


Hj 1 


281-419 


1115 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.00076 


26.2 




88-134 


1116 


LRR 


Leucine Rich Repeat 


2.1e*05 


31.4 


6 


37-64:65- 

92:93- 

120:121- 

142:175- 

198:255- 

278 


1117 


RasGAP 


GTPase-activator protein for Ras-Hke 
GTPase 


6.4e-28 


106.2 


1 


268-440 


1117 


PH 


PH domain 


0.28 


15.7 




7-78 


1117 


C2 


C2 domain 


1.9 


-9.5 


J 


91-171 


1117 


bZIP 


bZIP transcription factor 


9 


-6.7 


1 


997-1054 


1118 


COX3 


Cytochrome c oxidase subunit HI 


L7 


-228.9 


1 


77-195 


1118 


sugar_tr 


Sugar (and other) transporter 


2.2 


-179.3 


I 


32-413 


1120 


LMWPc 


Low molecular weight phosphotyrosine 
protein 


9.7G-56 


198.6 


1 


7-138 


1122 


M 


M protein repeat 


6.6 


13.1 


2 


148- 

168:216- 

236 


1123 


lectin c: 


Lectin C-type domain 


4.3e-ll 


50.3 


1 • 


579-646 


1123 


Tropomyo 
sin 


Tropomyosin 


0.17 


-90.1 


1 


304-500 


1123 


filament 


Intermediate filament protein 


0.17 


-164.6 


1 


287-537 


1123 


spectrin 


Spectrin repeat 


5.8 


-18.2 


1 


422-523 


1127 


vwa 


von Wiilebrand factor type A domain 


4Je-63 


223.1 


1 


252-450 


1127 


trypsin 


Trypsin 


2.1e-43 


157.6 


1 


463-734 
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SEQ ID 
1127 


Model 


Description 


E-valtie 


Score 


Repeats 


Position 


sushi 


Sushi domain (SCR repeat) j 


( 2.4e-24 


94.3 


3 


37-85:86- 

140:147- 

200 


1128 


Neur_cha 
n memb 


Neurotransmitter-gated ion-channel tra 


k Ie-88 


308.2 


1 


84-334 


1128 


oxidored 
ql N 


NADH-Ubiquinone oxidoreductase 


D 


-14.7 


1 


123-183 


1129 


C4 


C-terminal tandem repeated domain in 
type 4 


1.4e- 
148 


507.0 


2 


477- 
584-5X5 

699 


1129 


Collagen 


Collagen triple helix repeat (20 copies) 


1.9e-60 


214.3 


7 


20-78:84- 
142:143- 
202:205- 
265:266- 

37 V39Q 

388:405- 
464 


1132 


filament 


Intermediate filament protein 


2.1 


-193.2 


1 


90-330 


1132 


Tropomyo 
sin 


Tropomyosin 


7.7 


-119.0 


1 


151-353 


1135 


zf-C2H2 


Zinc finger, C2H2 type 


8,3e-26 


99.2 


5 


278- 

339:345- 
369:375- 

429 


1136 


Ribosoma 
[ S2 


Ribosoma! protein S2 


1.7c-78 


274.2 


. 


34-198 


1137 


ATP- 
synt ab 
N 


ATP synthase alpha/beta family, beta- 
ba 


5.2e-26 


99.8 




63-129 


1139 


ATP- 
synt__ab 


ATP synthase alpha/beta family, 
nucleot 


3.1e-06 


-33.1 




71-183 


1139 


ATP- 
synt ab 
N 


ATP synthase alpha/beta family, beta- 
ba 


0,015 


14.4 




10-68 


1140 


ATP- 
synt ab 
N 


ATP synthase alpha/bota family, beta- 
ba 


5.2e-26 


99.8 




63-129 


1140 


ATP- 
synt ab 


ATP synthase alpha/beta family, 
nucleot 


2.6e-06 


-31.8 


1 — 


132-261 


1141 


ank 


Ankyrin repeat 


6.7e-34 


126.0 


3 


463- 
495:496- 
528:529- 
561 


1141 


BRCT 


BRCA1 C Terminus (BRCT) domain 


L5e-15 


65.1 


2 


578- 

689:705- 

812 


1141 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.083 


9.7 




86-122 


1143 


GTP EFT 
U 


Elongation factor Tu GTP binding 
domain 


0.031 


-59.5 




68-140 


1145 


RF-1 


Peptidyl-tRNA hydrolase domain 


3.3e«05 


10.5 




46-159 


1146 


RIM 


Peptidyl-tRNA hydrolase domain 


6.4 


-51.5 




46-114 


1148 


WD40 


WD domain, G-beta repeat 


6.9e-07 


36.3 




44^80 


1149 


Band 41 


FERM domain (Band 4.1 family) 


l.le-77 


271.5 




45-235 


1150 


Metallothi 
o PEC 


Plant PEC family metallothionein 


5 


-38.3 




90-139 


1153 


pkinase 


Protein kinase domain 


1.7e-90 


314.1 


1 


43-299 
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1155 


SCP 


SCP-like extracellular protein 


6.2e-14 


56.1 


1 


28-200 


1156 


ras 


Ras family * 


1.5e-15 


40.0 


1 


5-98 


1159 


6PF2K 


6=phosphofructo-2-kinase 


5.2c- 
152 


518.4 


1 


26-249 


1159 


PGAM 


Phosphoglycerft mutase family 


6e-100 


345.5 


1 


250-435 


1160 


zf-C2H2 


Zinc finger, C2H2 type 


1.3e- 
104 


361.0 


13 


223- 

245:251- 
273:279- 
301:307- 
329:335- 
357:363- 
385:391- 
413:419- 
441:447- 
469:475- 
497:501- 
523:529- 
551:557- 
579 


1160 


PHD 


PHD-finger 


0.55 


-7.5 


1 


448-509 


1160 


zf-BED 


BED zinc finger 


14 


0.3 


2 


264- 

302:542- 

580 


1160 


DnaJ CX 

XCXGX 

G 


DnaJ central domain (4 repeats) 


2.9 


-45.3 


I 


509-572 


1160 


L1M 


JL1M domain 


u 


-19.2 


I 


477-539 


1160 


TFIIS 


Transcription factor S-II (TFIIS) 


7.9 


-6.3 


1 


450-485 


1162 


Patatin 


Patatin-Iike phosp ho lipase 


0.00033 


-0.6 


1 


1-171 


1163 


pkinase 


Protein kinase domain 


2.3e-94 


326.9 


I 


53-303 


1163 


RIOl 


RJO1/ZK632.3/MJ0444 family 


0.37 


-100.2 


I 


47-245 


1164 


Oxysterol 
BP 


Oxysterol-binding protein 


3.8e-47 


170.0 


1 


173-571 


1166 


OATP_C 


Organic Anion Transporter Polypeptide 
(OATP) 


9.9e- 
195 


660.3 


1 


68-443 


1166 


OATP_N 


Organic Anion Transporter Polypeptide 
(OATP) 


le-67 


238.4 


1 


520-680 


1166 


7tm 5 


7TM chemoreccptor 


6.1 


-167.4 


1 


184-503 


1166 


sugarjr 


Sugar (and other) transporter 


7.8 


-195.1 


1 


48-570 


1167 


pentaxin 


Pentaxin family 


2.3e-07 


-7.3 


1 


25-98 


1168 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


L7e-05 


31.7 


1 


537-574 


1169 


Peptidase 
Ml 


Peptidase family Ml 


4e-l 56 


532.1 


1 


69-458 


1170 


!8 ...... 


Immunoglobulin domain 


0.0016 


25.1 


I 


30-109 


1172 


BTB 


BTB/POZ domain 


8.7e-33 


122.4 


1 


104-214 


1173 


F-box 


F-box domain 


0.0019 


24.9 


1 


16-64 


1174 


TPR 


TPR Domain 


45 


9.3 


1 


301-334 


1175 


20G- 
FelI_Oxy 


20G-Fe(H) oxygenase superfamily 


1.6e-06 


35.1 


1 


527-648 


1175 


TPR 


TPR Domain 


45 


9,3 


1 


30W34 


1176 


20G~ 
Fell Oxy 


20G-Fe(II) oxygenase superfamily 
* 


L6e-06 


35.1 


1 


557-678 


1177 


NaJZaJS 

X 


Sodium/calcium exchanger protein 


l.le-23 


92.1 


1 


236-381 


1179 


lactamase 
B 


Metal lo-beta-Iactamase superfamily 


0.059 


-4,5 


1 


13-212 


1180 


fibrinogen 
C 


Fibrinogen beta and gamma chains, C- 
tcrm 


1.6e-32 


121.5 


1 


207-416 
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1181 




Macrophage migration inhibitory factor 
(MIF) 


7.6e«67 


235.5 




2-115 


1182 


SSE4 


Sodium rsolute sym porter family 


I.7e« 

234 


792,4 


1 


69-503 


1182 


xanfiir_pe 
rmefce 


Permease family 


4.3 


-193.8 






1182 


PNTO 


NAD(P) ttanshydrogenase beta subunit 


5 


-389.7 




33-325 


U82 


60KDJM 
P 


60Kd inner membrane protein 


6.2 


-129.6 




113-241 


1182 


oxidored 
ql 


NADH-Ubiquinone/plastoqui none 


7.4 


-169.1 


• 


87-298 


1183 


Cation_ef 
flux 


Cation efflux family 


2.3e-58 


207.3 


i 


73-311 


1185 


AAA 


ATPase family associated with various 
eel 


4e-89 


309.5 


7 


4.71 .qfift- 

620 


1185 


HypB Ur 
eG 


MvDft/tJreG nucleotide-bindinf? domain 


3.2 


-65.8 






1185 


ArsA_AT 
Pase 


Ani on-transporting ATPase 


8.5 


-195.7 


j 




1186 


HC03_co 
transp 


HC03- transporter family 


o 


1389.6 




141 -tfin 


1186 


xan_ur_pe 
rmease 


Permease family 


0.33 


-164.5 


» 


518-985 


1187 


homeobox 


Homeobox domain 


2.4e-16 


67.7 




51-107 


1188 


efhand 


EF hand 


6.7 


9.0 


J 


13-41 


1191 


GST_C 


Glutathione S-transfcrase, (^terminal 
domain 


0,93 


0.3 






1194 


PPR 


PPR repeat 


0.0019 


24.9 


j 


14-48 


1195 


thiored 


Thioredoxin 


0.018 


-13.7 


1 


390-497 


1197 


ENVjjoI 
yprotein 


ENV pol yprotein (coat pol yprotein) 


1.2e-08 


-24 J 


j 


86-529 


1200 


UQ_con 


Ubiquitin-conjugating enzyme 


1.3e-23 


91.9 


J 


60-190 


1202 


7tmJ 


7 transmembrane receptor (rhodopsin 
family) 


2.1e-23 


91.2 




59-306 


1202 


7tm 5 


7TM chemoreceptor 


4.3 


-164.7 


-j— 


37-314 


1203 


7tmJ 


7 transmembrane receptor (rhodopsin 
family) 


5.9e-37 


136.2 


j 


59-341 


1203 


7tm 5 


7TM chemoreceptor 


2.2 


-159.4 


J 


37-338 


1204 


SH3 


SH3 domain 


2.5e-05 


31.1 




257-317 


1204 


UBA 


UBA/TS-N domain 


0,00013 


28.8 




36-76 


1204 


PGAM 


Phosphoglycerate mutase family 


0.00044 


-75.5 


; 


438-625 


1205 


heme 1 


Heme/Steroid binding domain 


0.00053 


19.5 


L — 


37-112 


1207 


transmem 
brane4 


Tetraspanin family 


0.29 


-69.8 




11-110 


1208 


OATP_C 


Organic Anion Transporter Polypeptide 


1.3e- 
135 


464.0 


j 


148-524 


1208 


kazal 


Kazal-type serine protease inhibitor d 


0.11 


4.9 




555-601 


1208 


sugarjr 


Sugar (and other) transporter 


0.15 


-145.7 




128-626 


1208 


Hg_chan 


Ligand-gated ion channel 


3.3 


-153.6 




193-524 


1208 


7tmJ ^ 


7 transmembrane receptor 


4,2 


-115.5 




177-473 


1208 


Cytidylylt 
rans 


Phosphatidate cytidylyitransferase 


4.6 


-87.5 




177-268 


1215 


tliyroglob 
ulinj 


Thyroglobulin type-1 repeat 


3,2e-35 


130.4 


2 


90- 

153:216^ 
281 


1215 


kazal 


Kazal-type serine protease inhibitor 


7.5e-09 


42,8 


1 


40-84 


1215 


efhand | EFhand 


0,057 


20.0 


2 


351- 
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379:388- 
416 


1216 


annex in 


Annexin 


7e-61 


215.7 


2 


72- 

139:144- 
211 


1217 


annex! n 


Annexin 


2.2e-86 


300.4 


3 


44- 

111:116- 
183:199- 
267 


1218 


anncxin 


Annexin 


7e~61 


215.7 


2 


44- 

111:116- 
183 


1219 


Armadillo 

_seg 


ArmadiUo/beta-catenin-like repeat 


5.3e-30 


113.1 


5 


351- 

393:405- 
447:448- 
498:499- 
541:543- 
585 


1219 


GA 


GA module 


8.3 


-5.7 


1 


180-224 


1221 


p450 


Cytochrome P450 


le-122 


421.1 


1 


30-483 


1222 


p450 


Cytochrome P450 


Ue-06 


-35.8 


1 


30-328 


1223 


A2M N 


AlDha-2-macroslobulin familv N- 
terminal regi 


4.3e-12 


-71.6 


1 


1-468 


1225 


Thymosin 

HJ III ViJl'i 


Thymosin beta-4 family 


2.3e~16 


67.8 


1 


2-41 


1227 


WD40 


WD domain, G-beta repeat 


1.6e-37 


138.1 


7 


115- 

151:165- 
201:207- 
244:250- 
286:293- 
328:334- 
370:391- 
431 


1236 


F-box 


F-box domain 


5e-07 


36.8 


1 


210-258 


1236 


UvrD- 
helicase 


UvrD/REP helicase 


0.00011 


-157.2 


1 


441-920 


1237 


LRR 


Leucine Rich Repeat 


23e-25 


97.7 


8 


42-65:66- 

88:89- 

111:112- 

134:135- 

157:158- 

180:181- 

203:204- 

227 


1238 


TPR 


TPR Domain 


9.6e-54 


192.0 


10 


22-55:56- 

86:87- 

120:121- 

154:155- 

188:189- 

222:223- 

255:290- 

323:328- 

361:362- 

395 


1241 


cadherin 


Cadherin domain 


0.00011 


29.0 


2 


48- 

151:165- 
254 


1243 


TTL 


Tubulin-tyrosine Hgase family 


3.6e-31 


117.0 


1 


1-225 


1245 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 
famil 


4.2e-08 


40,4 


1 
... 


190-221 
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1245 


zf-UBP 


Zn-finger in ubiquitin-hydrolases and 


L2e-05 


32.2 


1 


6|-148 






other 










1246 


ank 


Ankyrin repeat 


3.8e-92 


319.5 


U 


1%44:45- 














7W9- 
lly:112- 














141:145- 














177:179- 














211:212- 














244:245- 














277:278- 














310:312- 














344:345- 














374 


1250 


Clq 


Clq domain 


0.00033 


-3.8 


1 


827-946 


1250 


filament 


Intermediate filament protein 


1.3 


-187.6 


\ 


360-645 


1250 


spectrin 


Spectrin repeat 


1.9 


-12.5 


1 


492-591 


1250 


TNF 


TNF(Tumor Necrosis Factor) family 


4.1 


-20.5 




835-946 


1250 


Apolipopr 


Apolipoprotein A1/A4/E family 


5.7 


-113,3 


~ 


200-469 




otein 












1250 


sigma70 


Sigma-70 factor 


9 


-116.2 


j 


462-650 


12S2 


lamimn_ 


Lamintn N-terminal (Domain VI) 


3.2e-52 


186.9 


\ 


1-223 




Nterm 












1252 


laminin E 


Laminin EGF-Iike (Domains III and V) 


2e-38 


141.0 


3 


225- 




GF 








292:295- 














355:358- 














409 


1252 


NTR 


NTR/C345C module 


7.2e-30 


112.6 


I 


479-591 


1252 


Keratin B 
2 


Keratin, high sulfur B2 protein 
i 


6.7 


-81.2 


1 


318-451 


1253 


enolase 


Enol-ase 


0.038 


-162,0 




11-136 


1256 


HIT 


HIT family 


1.5e-55 


198.0 


1 


51-162 


1257 


ank 


Ankyrin repeat 


5.9c-24 


93.0 




39-67:68- 














100:101- 














133:134- 














164 


1258 


lectin c 


Lectin C-type domain 


I.3e-21 


85.2 


I 


51-181 


1258 


lectin c 


Lectin C-type domain 


L3e~21 


85.2 


1 


51-181 


1262 


mito_carr 


Mitochondrial carrier protein 


8.7e-67 


235.3 




7-100:102- 














221:224- 














319 


1263 


serpin 


Serpin (serine protease inhibitor) 


8.7e- 


374.8 


1 


87-463 








109 








1264 


MHCJ 


Class I Histocompatibility antigen, 


1.3e- 


483.9 


1 


25-203 






domains 


141 








1265 


MHCJ 


Class I Histocompatibility antigen, 


2.3e- 


446.5 


I 


25-203 






domains 


130 








1266 


MHCJ 


Class I Histocompatibility antigen. 


2.7e- 


403,1 


1 


22-187 






domains 


117 








1267 


MHCJ 


Class I Histocompatibility antigen, 


l.le- 


397.7 




25-196 






domains 


115 








1268 


MHCJ 


Class I Histocompatibility antigen. 


2.9e- 


406.3 




25-196 






domains 


118 






* 


1268 




Immunoglobulin domain 


9.5e-08 


39.2 




221-286 


1269 


MHCJ 


Class I Histocompatibility antigen, 


4,2c- 


452.3 




25-204 






domains 


132 








1271 


MHCJ 


Class I Histocompatibility antigen, 


le-144 


494,2 




25-203 






domains 










1272 


MHC I 


Class I Histocompatibility antigen, 


2.8e-95 


329.9 




25-204 
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domains 




* - 






1273 


MHCJ 


Class I Histocompatibility antigen, 


3.3e- 


f 479.2 


1 


25-203 






domains 


140 








1274 


Kelch 


Kelch motif 


1.46-91 


■ 17.7 


6 


271- 










I 




316:318- 










I 




366:368- 










D 




413:415- 














460:462- 














507:509- 














554 


1274 


BTB 


BTB/POZ domain 


3.Ie-38 


140.4 


1 


23-130 


1275 


IQ 


IQ calmodulin-binding motif 


0.0037 


23.9 


1 


394-414 


1276 


Glycos_tr 


Glycosy! transferase 


2.2e-22 


87.8 


1 


126-308 




ansf 2 












1276 


RicinJBJ 


QXW lectin repeat 


0.0045 


23.7 


2 


478- 




ectin 










518:520- 














557 


1280 


ig 


Immunoglobulin domain 


3.4e-05 


30.7 


2 


62- 














145:174- 














240 


1281 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 


1.5e-I2 


55.1 


1 


100-131 






famil 










1285 


UBX 


UBX domain 


2.9e-22 


87.4 


1 


205-284 


1286 


UBX 


UBX domain 


2.9e-22 


87.4 


1 


251-330 


1287 


DnaJ 


DnaJ domain 


9e-38 


138.9 


1 


8-70 


1288 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finder) 


7.9e-09 


42.8 


I 


9-57 


1288 


zf-B box 


B-box zinc finger 


1.8e-06 


34.9 


[ 


177-217 


1288 


SPRY 


SPRY domain 


0.0016 


14.8 


1 


479-605 


1288 


zf-UBRl 


Putative zinc finger in N-recognin 


2.2 


-18.4 


1 


180-237 


1289 


PKD 


PKD domain 


0 


1026.8 


16 


293- 














372:397- 














483:485- 














568:572- 














677:679- 














763:765- 














846 : 849_ 














931:933- 














1017:1019- 














1099:1 101- 














1183:1185- 














1269:1271- 








• 






1353:1355- 














1438:1440- 














1522:1524- 














1612:1614- 














1696 


1289 


REJ 


REJ domain 


2,3e- 


978.0 


I 


1723-2248 








290 








1289 


PLAT 


PLAT/LH2 domain 


2.6e-25 


97.5 




2673-2789 


1289 


GPS 


Latrophilin/CL-Mike GPS domain 


l.le-15 


65.5 




2566*2615 


1289 


Lectin c 


Lectin C-type domain 


0.59 


-23.7 




2-87 


1289 


DUF26 


Domain of unknown function DUF26 


7.8 


-16.5* 




2220-2262 


1290 


CNH 


CNH domain 


2.7c-24 


94.2 




69-375 


1292 


RhoGAP 


RhoGAP domain 


l.So-59 


211.0 




125^279 


1293 


Peptidase 


Matrixin 


8.2e- 


378.2 




48-211 




M10 




110 








1293 


fh2 


Fibronectin type II domain 


3,5e-84 


293.1 


3 


230- 














27i:288- 
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0 








329:347- 
388 


1293 


hemopexi 
n 


Hemopexin \ 

I 
I 


le-33 


125.5 


3 


486- 
530:608- 
654:656- 
699 


1294 


Peptidase 
M10 


Matrixin ^ 


8.2e- 
110 


378.2 


1 


48-211 


1294 


fh2 


Fibronectin type n domain 


3,5e-84 


293.1 


3 


230- 
271:288- 
329:347- 
388 


1297 


UQ„con 


Ubiquitin-conjugating enzyme 


3.3e-20 


80.6 


1 


400-557 


1297 


TT ORF2 


TT viral ORF2 


3.6 


-92.9 


1 


546-667 


1297 


UQ„con 


Ubiquitin-conjugating enzyme 


3,3e-20 


80.6 


1 


400-557 


1297 


TT ORF2 


TT viral ORF2 


3.6 


-92.9 


1 


546-667 


1298 




Immunoglobulin domain 


2.9e- 
236 . 


798.3 


21 


25-84:119- 

177:245- 

303:339- 

397:432- 

490:524- 

583:618- 

676:709- 

768:801- 

859:894- 

1048:1121- 

1275:1382- 
1440* 1477- 

1535*1569- 
1627:1664- 
1722:1756- 
\Si6:mu 
1911:1947- 
2005:2040- 
2098 


1298 


Adeno E 
3 CRi 


Adenovirus E3 region protein CRI 


0.062 


-3.7 


1 


1212-1288 


1299 


cNMP_bi 
nding 


Cyclic nucleotide-binding domain 


6.2e-28 


106.2 


1 


363-459 


1299 


ion trans 


Ion transport protein 


8.9e^21 


82.5 


1 


69-265 


1299 


ATP- 
synt DE 


ATP synthase, Delta/Epsilon chain, 
long 


6.8 


6,0 


1 


478-525 


1301 


RGS 


Regulator of G protein signaling 
domain 


L6e-49 


177.9 


1 


56-172 


1302 


THF DH 

G CYH 
C 


Tetrahydrofolate 
dehydrogenase/cyclohyd 


lc-99 


344.7 


1 


60-235 


1303 


gln-synt 


Glutamine synthetase 


1.3e- 
179 


610.1 


1 


1-321 


1304 


gln-synt 


Glutamine synthetase * 


0.13 


-214.4 


1 


1-95 


1305 


SCAN 


SCAN domain 


7.5e*61 


215.6 


1 


42-137 


1306 


Methyltra 
nsf 5 


MraW methylase family 


4.6e-06 


-96.4 


1 


70-205 


1307 


ion trans 


Ion transport protein 


8.3e-18 


72.6 


1 


438-672 


1307 


ank 


Ankyrin repeat 


1.9e-06 


34.8 


2 


77- 

108:163- 
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195 


1307 


Srg • 


C.elegans Srg family integral 
membrane prot 


8.1 


-222.4 


1 


418-669 


1310 


Ammonfe 

m_transw 


Ammonium Transporter Family 


L9c-56 


200.9 


1 


25-429 


1310 


FecCD I 


FecCD transport family 


0.89 


-200,6 


1 


97-331 


1311 


Kelch 


Kelch motif 


2.6e-60 


213.8 


6 


311- 

359:361- 

411:413- 

458:460- 

505:507- 

556:559- 

606 


1311 


BTB 


BTB/POZ domain 


1.6e-28 


108.1 


1 


23-181 


1313 


zf-B_box 


B-box zinc Finger 


9,6e-30 


112.2 


2 


92- 

133:418- 
459 


1313 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


1.3c-23 


9L8 


2 


15-59:341- 
385 


1313 


SPRY 


SPRY domain 


3.2e-23 


90.6 




672-813 


1313 


PHD 


PHD-finger 


0.97 


-9.8 




14-62 


1313 


NB-ARC 


NB-ARC domain 


9.5 


-15L3 




48-311 


1313 


zf-UBRl 


Putative zinc finger in N-recognin 


9,8 


-24.7 




421-470 


1314 


SRCR 


Scavenger receptor cysteine-rich 
domain 


l,7e-25 


98.1 




37-133 


1315 


adh short 


short chain dehydrogenase 


5.8e-33 


122.9 


J 


38-293 


1317 


ANP 


Atrial natriuretic nentide 


1.2e-5I 


185.0 




41-150 


1318 


EGF 


EGF-1 ike domain 


3.6e-26 


100.4 


6 


512- 

547:553- 

591:597- 

634:640- 

675:747- 

781:787- 

827 


1318 


ig 


Immunoglobulin domain 


2.1e-06 


34.7 


-j 


170-227 


1318 


TIL 


Trypsin Inhibitor like cysteine rich dom 


1.9 


-6.8 




741-787 


1318 


ldl^recept 
a 


Low-density lipoprotein receptor 
domain 


6.6 


-8.8 


-j — — 


551-593 


1318 


TILa 


TILa domain 


6.9 


-8.5 


i 


733-792 


1321 


FKBP 


FKBP-type peptidyl-prolyl cis-trans 
isomeras 


5.7e-45 


162.8 


i 


234-336 


1322 


PX 


PX domain 


l.le-25 


98.8 




273-382 


1322 


SH3 


SH3 domain 


6.5e-12 


53.0 


_j 


3-59 


1323 


ras 


Ras family 


5.2e-16 


45.4 


J 


35-218 


1323 


OTP EFT 
U 


Elongation factor Tu GTP binding 
domain 


0.53 


-76.1 




31-223 


1323 


MobB 


Motybdopterin guanine dinucleotide 
synthesis 


4,8 


-45,8 


-j 


36-150 


1323 


GTP CD 

C 


Cell division protein 


5.7 


-217.6 




36-247 


1324 


pkinase 


Protein kinase domain 


4.6e-51 


183.1 




199-527 


1325 


pkinase 


Pfotein kinase domain 


4.6e-51 


183.1 




199-527 


1327 


Peptidase 
CI 


Papain family cysteine protease 


73e- 
110 


378.4 




73-349 


1328 


Peptidase 
CI 


Papain family cysteine protease 


7-3e- 
110 


378.4 


1 


114-390 


1330 




Immunoglobulin domain 


4.6e-07 


36.9 


2 


52- 

108:145- 
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201 


1332 


Sec7 


Sec7 domain 


1.3e-71 


251.3 


1 


345-536 


1332 


PH 


PH domain 


0.25 


16.2 


1 


567-676 


1332 


IQ 


IQ calmoduiiii-binding motif 


0.35 


17.4 


1 


13-33 


1333 


cadherin 


Cadherin domain 


I.6e-94 


327.4 


5 


68- 

159:173- 
268:282- 
386:399- 
490:503- 
600 


1333 


Cadherin_ 
C term 


Cadherin cytoplasmic region 


t.6e-76 


267.6 


1 


648-819 


1335 


lipocalin 


Lipocalin / cytosolic fatty-acid binding 
pr 


7.5e-35 


129.2 


1 


5-133 


1336 


SH3 


SH3 domain 


2.4e-10 


47.8 


1 


495-549 


1336 


PID 


Phosphotyrosine interaction domain 
(PTB/PID) 


0.29 


-13.3 


1 


52-181 


1336 


SAM PN 
T 


Sterile alpha motif (SAM)/Pointed 
domain 


6 


-25.0 


1 


608-687 


1336 


SH2 


SH2 domain 


8.6 


-25.5 


1 


268-347 


1338 


UCH-2 


Ubiquitin carboxyl-termlnal hydrolase 


L5C-20 


81.7 


1 


350-411 


1338 


UCH-1 


Ubiquitin carboxyl-terminal hydrolases 


l.le-12 


55.5 


1 


111-142 


1338 


TFIIA 


Transcription factor 1IA, alpha/beta s 


1,7 


-15L4 


1 


386-698 


1338 


Tvnto 45 
kd 70kd 


Tymovirus 45/70 Kd protein 


3.3 


-291.1 


1 


310-647 


1338 


MARCKS 


MARCKS family 


6.8 


-95.6 


1 


573-865 


1340 


RCC1 


Regulator of chromosome condensation 
(RCC1) 


5.6e- 10 


46.6 


6 


77- 

136:140- 
192:195- 
245:248- 
298:301- 
356:359- 
406 


1342 


Peptidase 
Ml 


Peptidase family Ml 


4.6 


-194.5 


i 


159-470 


1342 


ubiquitin 


Ubiquitin family 


5.4 


-4.4 


1 


538-616 


1345 


crystall 


Beta/Gamma crystallin 


3.3e~38 


140.4 




1-53:61- 
144 


1345 


WD40 


WD domain, G-beta repeat 


6.7e^09 


43.0 


1 


269-305 


1348 


AAA 


ATPase family associated wjjh various 
cellul 


0.97 


-33.9 


1 


131-307 


1349 


AAA 


ATPase family associated with various 
cellul 


0.97 


-33.9 


1 


131-307 


1351 


PHD 


PHD-finger 


6.3e-15 


63.0 




190-237 


1351 


PHD 


PHD-finger 


6.3e-15 


63.0 


~ 


190-237 


1353 


CUB 


CUB domain 


X3e-13 


57.3 




416-524 


1353 


sushi 


Sushi domain (SCR repeat) 


1.5e-05 


31,9 


T— 


357-412 


1355 


SPRY 


SPRY domain 


3,4e-19 


77.2 




396-519 


1355 


fh3 


Fibronectin type III domain 


le-08 


42.4 




259-345 


1355 


zf-B box 


B-box zinc finger 


8.9e-07 


35.9 




44-86 


1356 


Patatin 


Patatin-Iike phospholipase 


0.59 


-42.1 




10-118 


1358 


ELM2 


ELM2 domain 


3.2e-21 


84.0 




195-256 


1358 


mybJ>N 
A-binding 


Myb-like DNA-binding domain 


Ue-09 


45.6 




299-345 


1359 


Reprolysi 
n 


Reprolysin (M12B) family zinc metallo 


K4e«12 


55.3 




239-457 


1359 


Pep_Ml2 
B_propep 


Reprolysin family propeptide 


3.6e~12 


53.9 




90-216 
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11/ 


1T1.UU.C1 


jl/csli ipiiuii 


17 iraliia 

i!*-v«iue 


Score 


Kcpeais 


ruMiion 




ten 1 


ThrOmbospondin type 1 domoin 






















rni '09Q ^ 














o&f.otSO- ■ 




























I AA9- inn© 














1 A^7 P 








tz/ij mouuic 


si a 


1 1 7 


i 
1 


40Z-D j 1 


1 JO J 


ank 


AnKyrm lepeat 


D,ze-zt> 




j 


II- 














104.1UJ- 














1 J / .1 JO- 














i 7A. 1 QA 














7I7'299- 














9^^ 
Zjj 




CO 


\ja. uufntim 


lt>/ D 




o 

£. 


1 £1 
101- 














247:293- 














JOl 


H£7 


w 


v^>^ uujiiaui 


It" / J 




9 


lOl- 




























JO Jl 


1368 


p450 


Pvtochrome P4*ift 


JC Jl X\f 


378.9 







1370 


an nertriG 


Amino acid permease 


1.3e-08 


-108.1 




49-452 




ases 












1370 


Neur cha. 


TJfturftlt^WTitTiittpr-Pfttf^d inn-phnnnpl tr 

l^luUI gulV^U IU11 LrlltlllllL^f \\ 


0,37 


-96.5 


— 


JJVJOJ 




n memb 












1370 


ion tran*i 


Ion transport ni'titein 

lil UlJi?^JWl L J/1 IF I Will 


I A 


-2 3 




78.R-61 S 


1370 




JT vl IllCttOV k\JL \ij LUdlllw UUl UIC&j Ulu^I] 


I S 

1 . j 


-17S ^ 
I / o.o 


-j 






t niir 












1370 


An iran*! 


TYfinQni^rrihrfinp am inn nr.iH trnnenrtrt/^r* 

1 I 11 Willi L/iClUb ullllllU UUIU U ullA|JUlld 


I 9 

I *y 


-100 ^ 


-j 


67-400 


1370 


DUF140 


Domain nFiitiknAwn fnnptiftn Tl\ IPIdfl 


2.6 


-156 6 




10Q-T17 


1370 


Niicleosid 


Nucleoside transporter 


3,2 


-154.9 


— 






e tran 












1370 






4.1 




-j 














— — — 




lJ / V 




llllvglcll lllClUUIullC piVJLClll JJUI D 


7 1 






jjO-O/ 1 


1 ^TA 


VTA HMHli 


NADH dehydrogenase 


7 1 


-zi j.y 


-j 


zlZ-olo 


1 J /U 


olNr 


Sodium:neurotitinsmitter symporter 


0 9 




— . 


1 1 /'-45U 
















1372 


kinesin 


T£inft*ltrt tnntnf drtmnin 

JLVJIlV/Olll IllUlA/1 UUlIICllll 


2.1e- 


463.2 


™ 










135 








1372 


Translin 


Translin Family 


10 


-82.4 


1 


315-462 


1373 


LRR 


Leucine Rich Repeat 


1 .9e-27 


104.6 


7 


60-83*84- 












107:108- 














131:132- 














155:157- 














180:181- 














204:205- 














225 


1373 




Immunoglobulin domain 


1.2e-05 


32.1 




310-368 


1373 


LRRCT 


Leucine rich repeat C-terminal domain 


4.7e-05 


30.2 




249-294 


1373 


fh3 


Fibronectin type Hf domain 


7.Ie-05 


29.6 




425-505 


1373- 


LRRNT 


Leucine rich repeat N-terminal domain 


0.83 


13.7 




27-58 


1374 


kinesin 


Kinesin motor domain 


9.5e-14 


1.2 




1-131 


1375 


zf-DHHC 


DHHC zinc finger domain 


9Je-34 


125.6 




101-165 


1376 


EOF 


EGF-like domain 


3.2e^45 


163.6 


9 


49.84:90. 














126:132- 














167:177- 














213:217- 














252:286- 
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1 




321:327- 














362:368- 














401:407- 














442 


1376 


CUB 


CUB domain 


9.6e-18 


72.4 1/ 1 


809-918 


1376 


TIL 


Trypsin Inhibitor like cysteine rich 


0.73 






84-132 






domat 






) 




1376 


Keratin B 
2 


Keratin, high sulfur B2 protein 


0.9 


-67 6 





111-242 


1376 


granulin 


Granulin 


LI 




— „ — 


285-323 


1376 


metalthio 


Metallothionein 


5.9 




— 


363-431 


1376 


DUF141 


Domain of unknown function DUF141 


6.7 




-j 


799-922 


1380 


ion trans 


Ion transport protein 


0.066 


16 ft 


— 

J 


J J J" J 1 o 


1380 


ABC2jn 


ABC-2 type transporter 


6.1 






145-334 




embrane 












1380 


oxidored_ 


NADH-Ubiquinone/plastoquinone 


6.2 


-167 S 
-no t „j 


- 


46-317 




Ql 












1380 


OATP C 


Organic Anion Transporter Polypeptide 


9.5 







75-3 16 


1381 


pkinasc 


Protein kinase domain 


9.7e-80 


17ft. 1 


— 


205-486 


1384 


ig 


Immunoglobulin domain 


0.00034 


27*4 




J 


65-342 


1388 


LRR 


Leucine Rich Repeat 


4.9e-16 


Oo./ 




20-43:44- 














67:68- 














91:92-115 


1388 


LRRCT 


Leucine rich repeat C~terminal domain 


7.4e-09 


4Z.9 


J 


125-175 


1388 


GPS 


Latrophilin/CL-Mike GPS domain 


0.0041 


20.8 




641-693 


1388 


HRM 


Hormone receptor domain 


0.0076 


16.3 


,_ l 




1388 


7tm_2 


7 transmembrane receptor (Secretin 


0.01 


-96,0 




704-9 R 1 
















1388 


if* 


Tminiinrtfflobulin domain 




10.9 


l - 


iyo-zoj 


1389 


MACPF 


MAC/Perforln Hntnfiln 

ivinV' 1 lihuiui VIlMflCllll 


0.016 


-71.3 


1 


m^i 1 1 


1391 


HD 


domain 

lily UUiUCXill 


ft V-ft7 


36.1 


1 


■V) lOT 
j-Z-lZ / 


1392 




or iiaiiu 


1 (\s 

i ,»>e-UD 


31.8 


2 


1-25:3j*61 




MORN 


iVIWlvlN repeal 


i.ie-jz 


122.1 


7 


39-61:62- 














85:86- 














1 AO.I An 




























1 S4-1 














177*178- 














200 


1395 


MORN 


MORN repeat 


3.1e-31 


1 17 1 


O 


39-61:62- 














85:86- 














108:143- 














165:166- 














188:189- 














211 


1396 


EPHJbd 


Ephrin receptor ligand binding domain 


4.6e- 


462.1 


1 


31-204 








135 








1396 


pkinasc 


Protein kinase domain 


8e-74 


258.7 


1 


635-892 


1396 


fn3 


Fibronectin type III domain 


2.5e-31 


117.5 


2 


329- 














425:437- 














524 


1396 


SAM 


SAM domain (Sterile alpha motif) 


2.5e-21 


84.3 


T~~ 


928-992 


1396 


PHD 


PHD-finger 


3.7 


-15.1 


l 


259-296 


1397 


Exo_endo 


Endonuclease/Exonuclease/phosphatase 


0.34 


-11.2 


i 


357-643 




_phos 


fa 










1397 


sugar 


Sugar (and other) transporter 


8.8 


496.7 


i 


7-343 


1397 


Exo_endo 


Endonuclease/Exonuclease/phosphatase 


0.34 


-11.2 


l 


357-643 




_phos 


fa 
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1397 


sugar Jr 


Sugar (and other) transporter h 


8.8 


-196.7 


I 


7-343 


1401 


homeobox 


Homeobox domain ™ 


0.097 


-3.3 


1 


78-126 


1403 


Tropomyo 
sin 


Tropomyosin } 


2.4 


-110,1 


1 


341-547 


1403 


filament 


Intermediate filament protein V 


2.6 


-195.5 


I 


412-760 


1403 


UvrD- 
helicase 


UvrD/REP helicase | 


8 


-249,4 


1 


153-720 


1403 


Myosin t 
ail 


Myosin tail 


8.9 


-551.2 


1 


20-777 


1404 


Clq 


Clq domain 


0.00033 


-3,8 


I 


827-946 


1404 


filament 


Intermediate filament protein 


1.3 


-187.6 


1 


360-645 


1404 


spectrin 


Spectrin repeat 


1,9 


-12.5 


1 


492-591 


1404 


THF 


TNF(Tumor Necrosis Factor) family 


4.1 


-20.5 


1 


835-946 


1404 


Apolipopr 
otein 


Apolipoprotein A1/A4/E family 


5.7 


-1 13.3 


\ 


9ftft-4fiQ 


1404 


sigma70 


Sigma-70 factor 


9 


-116.2 


I 


462-650 


1405 


Cache 


Cache domain 


6.5e-12 


53.0 


2 


402- 

dRT -791 

813 


1406 


ASC 


Amiloride-sensitive sodium channel 


2e-125 


430.1 


1 


159-579 


1407 


nlf iniiw* 


PfAtfMtl IfltiftQi* Hfimnin 
x IvlCIII (vllltlov UULUalll 


ft 00 


-i l j.j 


i 
i 


*C 91*7 


1408 


PBP 


Phn^nhatiHvlpthfinrilnmirti^-iSJrtf^tritT 

protein 


1 .OC" / J 




I 
l 


1-10/ 


1410 


abhydrola 

ss 


alpha/beta hydrolase fold 


1.4 


-12.1 


1 


75-318 


1412 


tTflfl 


rppnonitifMi mr*tif 


£. . / C" J Z, 




j 


9^0 

329:360- 
433:477- 
550 


1414 


DEP 


Domain found in Dishevelled, Egl-I0 t 
and Pie 


3.5e-32 


120.3 


2 


173- 

247:275- 


1414 


PH 


PH domain 


tp-09 


4*5 7 

4J. / 






1414 


PDZ 


PD7, domain ( Alsn Ittinu/n ac FIT-TR nr 

GLGF) 




1ft 7 

to. / 


0 


j 1 J- 

456:460- 
531 


1418 


SCAN 


SCAN domain 


2.6eol 


183.9 


1 


36-131 


1418 


zf-C2H2 


Zinc finger* C2H2 type 


4.6e-25 


96.7 


5 


406- 

428:435- 

457:463- 

485:522- 

545:553- 

575 


1418 


KRAB 


KRAB box 


0.023 


14.2 


i 


220-260 


1418 


zf-BED 


BED zinc finger 


10 


-7.4 


i 




1419 


EGF 


EGF-like domain 

* 


5.1e-28 


106.5 


9 


13-40:45- 

72:7% 

104:109- 

136:141- 

168:173- 

200:205- 

232:237- 

264:269- 

296 


1419 


Keratin B 
2 


Keratin, high sulfur B2 protein 


1.2 


-69.3 


1 


100-251 


1419 


Gamma- 
thionin 


Gamma-thionins family 


4.8 


-8.7 


1 


124-163 
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Mrtflpl 

XT JL\J U R 




jCi-Viiiue 


Score 


Repeats 


Position 


1421 


Ferric red 
uct 


Fcrric^reductasc like transmembrane 
com 


? ftp-fid 


777 ft 


i 
i 




1421 


MAD bin 
ding 


OviflrtrpHiirfncp TsIAD-hinHtno Hnmnin 

O^iduc^eNADbindrngdomam 


A 


Id 1 


i 
i 


tot cc\ 


1421 


FAD bin 

ding_6 


Oxidoftductase FAD-binding domain 


5 


"28.7 


t 
i 


At J- J J) J 


1421 


ruhredtyjci 
n 


Ruhredoxin 


fi 0 


1 1 ft 


i 
i 




1422 


zf-C2H2 


Zinc finger, C2H2 type 


5e-12 


53.4 


2 


1057- 

lU/y:10o5- 
1109 


149? 






K 
J 


"O.O 


i 


1056-1 101 


1422 


kd 70kd 


TH/irmvipiw A*sJ7(YfC H nrntpirt 


£ 
O 




i 


343-700 


1422 


zf-BED 


BED zinc finger 


7.1 


-6.0 


I 


1070-1110 


1423 


7tm 5 


7TM chemoreceptor 


3.3 


-162.5 


I 


181-451 


1423 


Cytidylylt 
ranss 


Phosphatidate cytidyl yltransferase 


5 


-87.9 


1 


21-135 


1424 


CAP OL 
Y 


CAP-Gly domain 

. — . , — — n ^^, — -- 


7.2e-46 


165.8 


2 


196- 

238:398- 
440 


1 A *>A 


anK 


Ankynn repeat 

— ■ ■ 




9.2e-09 


42.5 


3 


1-40:42- 
76:79-1 1 1 




t* AD') 


PAP2 superfamily 


1 £ ~ AO 

1.5e-08 


41.5 


I 


166-313 


1426 


SCAN 


SCAN domain 


6.2e-70 


245.7 


1 


33-128 






jCinc ringer;, CzHz type 


1.5e-44 


161.4 


6 


239- 

261:267- 

289:295- 

317:323- 

345:351- 

373:379- 

401 


1426 








j.i 


I 


zoO- 3 1 8 


1426 


DC1 


TTir^I H rutin iii 


J. A 




t 




1426 




z^iiiiy iin^cij type ^IWU Uuluctinu^ 


0 7 


-jj. i 


i 


323-364 


1427 


van nr np 

riti ease 




1 1 


-100 s 


i 
i 


1U4-4D3 


1428 


LRR 


Leucine Rich Repeat 


2*3e-16 


67.8 


4 


80- 

127:128- 
151:152- 
175 


1428 


LRRCT 


Leucine rich repeat C-terminal domain 

C 


0.00079 


26.2 





185-234 


1431 


PH 


PH domain 


7.6e-15 


62.8 




19-117 


1432 


PH 


PH domain 


6.4e-21 


82.9 


■ 


55-153 


1434 


filament 


Intermediate filament protein 


2,9 


-196.8 




128-488 


1434 


K-box 


K-box region 


4.1 


-38.4 




277-357 


1434 


OspD 


Borrelia outer surface protein D 


13 


-69.3 




151-409 


1434 


Apolipopr 
otein 


Apolipoprotein A1/A4/E family 


5.7 


-113.3 




56-318 


1437 


RhoGAP 


RhoGAP domain 


5.7e~57 


202.7 




1152-1305 


1437 


PH 


PH domain 


3.5e-18 


73.8 




922-1030 


1437 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


0.0013 


25.4 




28-128 


1441 


annexin 


Annexin 


4.1e- 
109 


375.9 


4 


18^79:80^ 
135:151- 
219:227- 
294 
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SEQ ID 


Model 


Description 


E-Valuc 


Score 


Repeats 


Position 


L443 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.0075 


18.7 




1256-1307 


»43 


PHD 


PHD-finger 


0.18 


-2.9 


1 


1255-1310 


1443 


WD40 


WD domain, G-beta repeat 


2,9 


13.4 


1 


189-225 


l|i3 


K tetra 


K+ channel tetramerisation domain 


3.5 


-39.5 




830=916 


lfltf 


Clathrin 


Region in Clathrin and VPS 


8.9 


-25-2 


1 


976-1129 


1&4 


Rita 


Regulatory subunit of type 11 PKA R-s 


0.31 


10.8 


1 


17-54 


1*44 


AJpha_ad 
aptin C 


Alpha adaptin AP2, C-terminal domain 


7 


-63.6 


1 


91-155 


1445 


TK 


Thymidine kinase 


3.4e-98 


339.6 


\ 


61-231 


1446 


GDPD 


Glycerophosphoryl diester phosphodies 


5,4e-08 


40.0 




154-403 


1446 


Glycosjr 
angf_4 


Glycosyl transferase 


4.9 


-87.4 


J 

~ — - — 


22-145 


1447 


adh_short 


short chain dehydrogenase 


0.98 


-91.8 




2-196 


1448 


K-box 


PC-box region 


0.64 


-28.5 


i 


61-126 


1449 


PHD 


PHD-finger 


0.01 


8.4 


i 


1-42 


1452 


rnn 


RNA recognition motif. 


4.5e-19 


76.8 




77-148 


1454 


rvt 


Reverse transcriptase 


3.6e-34 


126.9 


J 


385-570 


1454 


GagJMA 


Matrix protein (MA), pi 5 


0.0018 


-21.0 


1 


10-131 


1454 


Gag^p30 


Gag P30 core shell protein 


0.54 


-80.3 




211-390 


1458 


COX5A 


Cytochrome c oxidase subunit Va 


1.2e-55 


198.3 


1 


42-131 


1459 


Guaiiylate 
kin 


Guanylate kinase 


6.2e-38 


139.4 




515-624 


1459 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


6.8e-ll 


49.6 


- . 


256-335 


1459 


SH3 


SH3 domain 


0.027 


5.9 




348-415 


1459 


L27 


L27 domain 


0.049 


20.1 




186-238 


1459 


Caulimo_ 
mov 


Caulimovirus movement protein 


7.1 


-185.3 


1 


420-673 


1459 


A_deamin 
ase 


Adenosine/AMP deaminase 


7.8 


-138.5 


1 


64-421 


1461 


hexokinas 
e 


Hexokinase 


4.3e- 
284 


957.2 


1 


53-499 


1463 


Occludin 


Occludin/ELL family 


6.3 


-249.1 


J_ 


33-394 


1464 


trypsin 


Trypsin 


4.7e-72 


252.8 




30-232 


1466 


DDHD 


DDHD domain 


8.6e- 
117 


401.4 




613-860 


1466 


DUF203 


Domain of unknown function 


8.7 


-69.8 




254-460 


1467 


Glycos__tr 
ansf 1 


Glycosyl transferases group 1 


1.8e-27 


104.7 


1 


286-470 


1468 


EMP24 
GP25L 


emp24/gp25L/p24 family 


3.5e-70 


246.6 


1 


5-183 


1469 


EMP24 
GP25L 


emp24/gp25L/p24 family 


3.5e-8l 


283.1 




5-208 


1470 


14-3-3 


14-3-3 protein 


2.2e* 

142 


486.5 


■ — 


5^249 


1471 


filament 


Intermediate filament protein 


0.53 


-177.6 




2-249 


1471 


spectrin 


Spectrin repeat 


7.1 


-19.1 


j 


34-130 


1472 


MtN3 slv 


MtN3/saIiva family 


5.4 


-31.6 




35-139 


1472 


ATP- 
synt A 


ATP synthase A chain 


7.4 


-80.2 




91-214 


1474 » 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H type 


4,3e-07 


37.0 


2 


531- 

557:663- 

688 


1474 


zf-C2H2 


Zinc finger, C2H2 type 


6.3 


11.3 


2 


205- 

229:618- 

642 


1475 


SpoU_me 
thylase 


SpoU rRNA Methylase family 


2.1C-27 


104.5 


1 


145-301 
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SEQID 


Model 


Description 


E-valuc 


Score 


Repeats 


Position 


1476 


filament 


Intermediate filament protein 


2A 


-194.5 




427-720 


1476 


Tropomyo 
sin 


Tropomyosin 


3 


-111.7 




539-761 


1476 


spectrin 


Spectrin repeat 


3.3 


-15.3 




427-526 


1476 


K-box 


K-box region 


7.1 


^41.3 


; 


248-335 


1476 


BorreIia_ 
orfA 


Borrelta ORF-A 


9.8 


-102.9 




440-736 


1477 


MMR HS 
Rl 


GTPase of unknown function 


1.5c-90 


314.2 


! 


178-521 


1477 


DUF258 


Protein of unknown function 3 DUF258 


9-6 


-84.6 


1 


343-465 


1479 


RNase P 
H 


3' exoribonuclease family 


1.2e-96 


334.5 




48- 

251:358- 
581 


1479 


SI 


SI RNA binding domain 


0.057 


19.9 




675-750 


1479 


KH- 
domain 


KH domain 


0.35 


9.3 


j 


609-651 


1482 


COLFI 


Fibrillar collagen C-terrainal domain 


5.8e-29 


109.6 




97-277 


1482 


Collagen 


Collagen triple helix repeat (20 copies) 


8.5e-05 


29.4 


I 


2^61 


1483 


COLFI 


Fibrillar collagen C-terminal domain 


1.6e-35 


131.4 


1 


110-293 


1483 


Collagen 


Collagen triple helix repeat (20 copies) 


8.5e-05 


29.4 




2-61 


1484 


CH 


Calponin homology (CH) domain 


5.6e- 14 


59,9 


1 


4-304 


1485 


MoaE 


MoaE orate in 


7 


-55.7 




21-96 


1486 


zf-C2H2 


Zinc finger, C2H2 type 


6.8 


10.9 




871- 

896:904- 

929 


1487 


zf-C3HC4 


Zinc fmger, C3HC4 type (RING finger) 


2.6 


-3.3 


j 


336-377 


1489 


FH2 


Formin Homology 2 Domain 


0,00017 


-49.7 


1 


3-329 


1490 


AAA 


ATPase family associated with various 


4e-45 


163.3 




370-565 


1490 


SKI 


Shikimate kinase 


0.068 


^52.5 


1 


369-506 


1490 


Viral hell 
easel 


Viral fSunerfamilv 1^ RNA helicase 


1.8 


-67.0 




371-563 


1490 


LON 


ATP-denfindent m*otease La fLON^ 
domai 


3.6 


-69,4 




12-220 


1491 


Tropomod 
ultn 


Tropomodulin 


4.1e-78 


272.9 




34-402 


1491 


WH2 


WH2 motif 


0.83 


16.1 




534-553 


1491 


pkinase 


Protein kinase domain 


5.9 


-136.0 


rj 


334-538 


1494 


xan_ur_pe 
rmease 


Permease family 


2.9 


-189.3 




72-377 


1494 


Na^sulph 
symp 


Sodium :sulfate symporter 
transmembran 


5.3 


-356.1 


-j 


212-541 


1494 


Glycosjr 
ansf 4 


Glycosyl transferase 


7.3 


-90-3 


1 


374-528 


1494 


STE3 


Pheromone A receptor 


7.5 


-203.9 


; 


314-603 


1494 


DUF221 


Domain of unknown function DUF221 


9.6 


-234.2 




196-576 


1494 


7tm 5 


7TM chemoreceptor 


9.7 


-171.0 


i 


122-365 


1494 


oxidored 


NADH-Ubiquinone/plastoquinone 


9.7 


-171.5 


i 


42-264 


1495 


lectin c 


Lectin C-type domain 


4.70-34 


126.6 




53-164 


1496 


Cytochro 
me B561 


Cytochrome b56l 


2„le- 
113 


390.2 




1-240 


1498 


Hydrolase 


haloacid dehalogenase-like hydrolase 


0.0045 


16.9 




31-443 


1498 


Cation_A 
TPase C 


Cation transporting ATPase, C-terminu 


0.26 


-25-8 




535-706 


1498 


oxidored 
q4 


NADH-ubiquinone/plastoquinone 
oxidore 


4,4 


-34.0 




631-705 


1499 


DEAD 


DEAD/DEAH box helicase 


3e~64 


226.9 




139-356 


1499 


helicase 


Helicase conserved C-terminal domain 


4.1e-32 


120.1 




447-518 



342 



WO 02/070539 



PCT/US02/05095 



SEQ ID 


Model 


Description 


K-value 


Score 




Position 




C 












1502 


CZ 


C2 domain 


1.3e-51 


184.9 


-2 >~ 


28- 

109:184- 
264 


1503 


ank 


Ankyrin repeat 


L6e-31 


118.2 


D 


743- 

775:776- 

807:808- 

840:842- 

874:875- 

915:916- 

948 


1504 


ank 


Ankyrin repeat 


t.4e- 
167 


570.1 


16 


57-89:90- 

122:123- 

155:210- 

242:243- 

275:276- 

JUii:joi- 

398:399- 

464:525- 

691*644- 
675:678- 
710:711- 
743:744- 
776 


1504 


SAM 


SAM domain (Sterile alpha motif) 

— „ .... -,. . i „ " £ 


1.3c-09 


45.4 


1 


872-934 


1504 


PARP 


Poly(ADP-ribose) polymerase catalytic 
domai 


0.022 


-59.4 


I 


954-1161 


1504 


3Beta HS 
D 


3-beta hydroxysteroid 
dehydrogenase/isomera 


1.3 


-204.0 


1 


208-536 


1505 


PHD 


PHD-ftnger 


0.26 


-4.5 


1 


132-191 


1505 


DC1 


DC1 domain 


2.8 


5.8 


1 


131-159 


1506 


PHD 


PHD-iinger 


0-26 


-4.5 


1 


156-215 


1506 


DC1 


PCI domain 


2.8 


5.8 


1 


155-183 


1507 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


3.6e-08 


40.6 


1 


224-261 


1507 


LON 


ATP-dependent protease La (LON) 
domain 


0.007 . 


-21.6 


1 


305-510 


1507 


TPR 


TPR Domain 


1.6 


13.6 


2 


41-74:75- 
108 


1508 




Immunoglobulin domain 


le-76 


268.3 


12 


78- 

111 '171- 

245:276- 

330:364- 

432:463- 

516:552- 

623:654- . 

705:740- 

797:828- 

880:914- 

9&L1012- 

1067:1101- 

1169 


1512 


FYVE 


FYVE zinc finger 


3.2e-14 


60.7 


1 


152-261 


1512 


HypA 


Hydrogenase expression/synthesis 
hypA 


0,81 


-51.2 


1 


97-194 


1512 


RNA PO 
L M 15K 


RNA polymerases M/15 Kd subunit 


2.1 


-1.7 


1 


160-212 
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SEO H> 


Model 


Description 


E-value 


Score 


Repeats 


Position 




D 












1516" 


Gelsolin 


Gelsolin repeat 


*0.76 


5.9 




1011-1052 


1518 


V1R 


Vomeronasal organ pheromone 


|2.3e-ll 


41.1 




73-338 






receptor family 










1518 


7tm_l 


7 transmembrane receptor (rhodopsin 


fi).044 


-69.7 


1 


62-325 






family) 


r 








1520 


SPRY 


SPRY domain 


S.2e~24 


93.2 




95-230 


1521 


Tropomod 


Tropomodulin 


0.065 


-128.4 


I 


214-495 




ulin 












1521 


LRR 


Leucine Rich Ren eat 


1,8 


15.0 


4 


348- 














371:376- 














403:404= 














427:432- 














456 


1522 


7f-CCHC 


7ine knuckle 


0 051 


15.9 


1 




IDA.3 




i3K.pi luifiuy 


fx Ip-IA 

U.jC"1u 




i 
i 




1 -TO A 


ivnOvjAr 


luioo/Vr uoinaLn 


A Oa If 


1 1 £ ft 


i 
i 


i^r IOC 


1 JZK 

1 jZj 


T T/"\ AAR 

uv_con 


Ubiquitin-conjugating enzyme 


I An- lO 




i 

i 








jueucine isicn Repeat 




I jIA 


o 

7 


KO- 




























IJJ . 1 J4 














157*1 5ft- 














181:182- 














7ft5-Tft£_ 














770-7^ft- 














951 *75*L 

/iJ 1 .*i J*r-' 














777'77Q. 














^07 

JUL 




LRRNT 


T^itfiinft rif.li rpnpnt lNJ-tfifminal Hfimfiin 


6 6p-06 


33.1 




TVfift 
JJ"UU 


1 S77 




f Piifinp Hpfi rpnpfit f^.fprtniiifif rlr*msnn 
i_.cuuiii^ uisii icpccH ^"i^iiUJiiai uuiiiaiu 




17 O 


I 






IV ICUU 


K -1- r*iicinnf»l tf»fi*iirrif*i*ictififitt Hnmuin 
Iv~ UllallllCl ICU tUllCl INaUUH UUITlcllll 


ft flftlfi 


-S ft 


1 
1 


117 'Jon 




MORN 




1 ,OC-Zit 


04 7 


7 
















lU/l.iUYZ- 














ino/i> 1 1 f»n_ 

I "J?*** 1 1UU- 














1 197-1 191- 

1 1 /tifc. 1 1 J" 














1143:1 151- 














1171:1198- 














1220:1221- 














1244 


1529 


VPS9 


Vacuolar sorting protein 9 (VPS9) 


8.3e-06 


32.7 


1 


1551-1656 






domain 










1529 


RCCl 


Regulator of chromosome condensation 


8,56-06 


32.7 


3 


168- 






(RCCl) 








216:527- 














574:579- 














625 


1529 


RhoGEF 


RhoGEF domain 


0.097 


-40.8 


1 


694-884 


1529 


PH 


PH domain 


0.23 


16.5 


1 


901-1005 


1530 


profilin 


Profilin 


1.4e-63 


224.6 


1 


3-135 


1531 


profilin 


Profilin 


7.4e-48 


172.4 


1 


3-119 


1532 


60s„ribos 


60s Acidic ribosomal protein 


3.3 


-34.2 


1 


39-153 




omal 












1533 




jmjC domain 


0.01* 


-0.1 


1 


1-50 


1533 


PHD 


PHD-finger 


0.042 


2.9 


2 


508- 














549:609- 














655 


1534 


kinesm 


Kinesin motor domain 


3.5e-64 


226.7 


I 


1-177 


1536 


aminotran 
3 


Aminotransferase class-in 


7.8e-42 


152.4 


1 


1-373 
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SEQ ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 


1536 


LEA 


Late embryogeriesis abundant protein 


5.6 


-4.7 


1 


109-177 


1539 


Hydrolase 


haloacid dehalo^enase-like hydrolase 


0.0064 


15.3 


1 


264-685 


1539 


Cation_A 
TPase_C 


Cation transporting ATPase, C-terrninu 


4.6 


-46.1 


1 


784-916 


1541 


TPR 


TPR Domain 1 


0.00036 


27.3 


2 


135- 

168:204- 
237 


1542 


PCMT 


Pi-otein-L»isoa3partate(D-aspartate) 0- 
methyl 


t.2e-ll 


21.8 


1 


9-224 


1543 


Peptidase 
C54 


Peptidase family C54 


3.1e-58 


206.9 


1 


76-364 


1545 


homeobox 


Homeobox domain 


4.8e-26 


100.0 


1 


233-286 


1546 


zf~C2H2 


Zinc finger, C2H2 type 


3.1e-85 


296.6 


14 


14-36:42- 
64:70- 
92:99- 
122:128- 

221:227- 

277:283- 
305:311- 
333:339- 
361:367- 
389:395- 








1 0 


-LO 


i 
i 




I Jf O 




JDJCLy /Si fit linger 


7 T 
A.J 


-1 7 


i 

i 




1546 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


4.6 


«5.4 


I 


341-372 


1547 


Ribosoma 
I S5 


Ribosomal protein S5 


3.4e^09 


44.0 


1 


222-352 


1 KAfl 




iv^ cairnuuuiin-Dinaing mom 






c 
D 


768-771- 

/ UO. III- 

791 :794- 
814:935- 
955:958- 
978 


1552 


DUF6 


Integral membrane protein DUF6 


0.14 


6.6 


1 


150-279 


1552 


SBF 


Sodium Bile acid symporter family 


9.2 


-75.2 


1 


143-321 


1553 


zf-C2H2 


Zinc finger, C2H2 type 


2,Ie-05 


31.4 


3 


80- 

105:107- 
130:144- 
169 


1554 


F-box 


F^box domain 


7.7e-05 


29,5 


1 


4-52 


1555 


Ran BP1 


RanBPl domain 


Lle~88 


308,0 


1 


37-161 


1555 


WH1 


WH1 domain 


6.8 


-26.8 


1 


45-159 


1556 


actin 


Actin 


8.4c- 
151 


514.4 


1 


1-372 


1557 


OTP EFT 
U 


Elongation factor Tu GTP binding 
doma 


9.7 


-93.3 


1 


91-355 


1557 


Defensin_ 
propep 


Defensin propeptide 

* 


9.8 


-11.4 


1 


1-50 


1559 


GTP EFT 
U 


Elongation factor Tn GTP binding 
domain 


L5e-ll 


51.8 


1 


125-348 


1559 


GTP EFT 
U D3 


Elongation factor Tu C-terminal 
domain 


8.1e-07 


33.1 


1 


451-541 


1559 


GTP EFT 
U D2 


Elongation factor Tu domain 2 


le-06 


35.8 


1 


363-446 
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SEQ ID 


Model 


Description 


E-vnhie 


Score 


Repents 


Position 


1559 


ATP-bind 
1 


Conserved hypothetical ATP binding 
protei 


8.5 


-132.9 


1 


126-312 


1559 


dyfcamin 


Dynamin family 


8,7 


-85.0 




110-278 


1567 


calb_anhy 


Eukary otic-type carbonic anhydrase 


4,6e- 
170 


578.4 


1 


5-241 


1568 


ArJj)erox 
idase 


Animal haern peroxidase 


8.1e- 
164 


557.6 




144-683 


1568 


DUF37 


Domain of unknown function DUF37 


6.5 


-36.0 




462-518 


1569 


DAG 


FAD dependent oxidoreductase 


0.055 


-90.8 


1 


49-381 


1571 


CH 


Calponin homology (CH) domain 


l.4eT25 


98.4 




126-233 


1573 


NUDIX 


MutT-like domain 


5.5e-12 


53.3 


1 


96-221 


1574 


HECT 


HECT-domain (ubifjuitin- transferase) 


4.3e-16 


66.9 




281-573 


1575 


ig 


Immunoglobulin domain 

****** 4 Ml 3\/£yl\JU Ul 11 1 UVI 1 J Ul 1 1 


2.8 


11.5 




122-187 


1577 


7tm 1 


1 transmembrane receptor 


3.4 


-113.2 




42-246 


1577 


Bac_rhod 
opsin 


Bacteriorhodopsin 


9,6 


-139.9 




Hl-313 


1578 


fn3 


TMhrnrtprfrin fvnp TTT rlnmsiin 

k ll/lUHCuLlll lj|JC 111 UUlIlulll 


0 91 


1 1 7 


-: 


1 At 1 Z, I 1 


1579 


Acetyltran 
sf 


Acetyl transferase (GNAT) family 


2.4e-14 


61.1 


n 

V 


548-624 


1580 


Acetyltran 

sf 


Acetyltransferase (GNAT) family 


2.4e-14 


61.1 




653-729 


A JOV 


Acstvltfan 

r\ lrwl- J 1 II Mil 

sf 


rxvvvyi u cuioJLvl ujV ^vjin/a. l j laiiiiiy 


9 4p-1/t 


61 1 

Ul • 1 





651-790 


1582 


GtDA 


ftliiruvip tnhlhitpfl Hivi'sinn nrotain A 

UJllVA/tK* IIIHIL/IL&U UlVliJIUJl ^/IfJL^llL / v 


0.0017 


-414.5 




UO**l 7U 


1583 




RF hand 


1 9 


14 0 


—. 




1585 


Dyneinji 
eavy 


Dynein heavy chain 


3.6e-18 


-92.1 




1-363 


1587 


Sm 


Sm protein 


2e-07 


38.1 




43^124 


1588 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


3.1e-15 


64J 


_j 


3-83 


1590 


MAPI L 

C3 " 


K^ippftfiihi ilp ficcrtplfifpH tt^ntpin 1 A/1R 

IVIIOI ULUUU1C CljaUljIcLlCLl UIUIC1I1 1 r\l I t>j 

lipht 




-35. 1 




CK) 1 «7 


1591 


Sviitflxin 

kJJ 11 l«-A Hi 


Svrttavin 


2.3e-09 


38.1 


-j 


1 *rUU 


1591 


synaptobr 
evin 


Synaptobrevin 


5.8 


-14.5 




184-272 


1591 


DUF148 


Domain of unknown function DUF148 


7.7 


-38 A 


• 


17-129 


1592 


Iflrrtinin T*. 

GF 


Laminin F.ftF-lilcfi ^Domains HI find 


12 


-4,2 




153-196 


1592 


EOF 


EGF-like domain 


2.4 


10.9 


3 


140- 
177:284- 
313:351- 
380 


1592 


metalthio 


Metal lothionein 


4.8 


-9.3 


1 


288-348 


1593 


DnaJ 


DnaJ domain 


3.4e-40 


146.9 


1 


3-69 


1594 


HMGJ50 

X 


HMG (high mobility group) box 


2.6e-27 


104.1 


1 


346-414 


1598 


HMG_bo 

X 


HMG (high mobility group) box 


3.4e-30 


113.8 


1 


45-113 


1600 


CUB 


CUB domain 


6,9 


-43.0 


1 


224^313 


1601 


DUF6 

* 


Integral membrane protein DUF6 


l.le-12 


55.6 


2 


113- 

238:266- 
390 


1601 


sugarjr 


Sugar (and other) transporter 


5.7 


-191.4 


1 


55-405 


1601 


oxidored 
Jj] „„ 


NADH-Ubiquinone/plastoquinone 
(complex I) 


6 


-167.3 


1 


131-389 


1602 


FF 


FF domain 


2.3e-33 


124.3 


5 


272- 

321:339- 

388:406- 
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E-valuc 


Score 


Repeats 


Position 














461:486- 
541:622- 
673 


1602 


WW 


WW domain 


4.3e-20 


80.2 


2 


88- 

117:129- 
158 


1603 


FF 


FF domain 


2.3e^33 


124.3 


5 


235- 

284:302- 

351:369- 

424:449- 

504:585- 

636 


1603 


WW 


WW domain 


l.le-09 


45.6 


1 


92-121 


1605 


AT hook 


AT hook motif 


1.9 


11.9 


1 


360-372 


1606 


aminotran 
5 


Aminotransferase class* V 


1.3e- 
127 


4373 


1 


37-377 


1607 


aminotran 
5 


Aminotransferase class-V 


3.5e-94 


326.3 


I 


37-331 


1611 


Granin 


Granin (chromogranin or secretogranin) 


6.6 


-185.2 


1 


125-609 


1612 


PHD 


PHD-fmger 


0.59 


-7.8 


I 


55U610 


1613 


Branch 


Core-2/I-Branching enzyme 


le-77 


27L6 


1 


46-313 


1614 


mbt 


mbt repeat 


3.2e- 
101 


349.7 


4 


78- 

153:192- 
265:304- 
381:412- 
486 


1614 


SAM PN 
T 


Sterile alpha motif (SAM)/Pomted 
domain 


0.0021 


6.9 


1 


809-888 


1614 


SAM 


SAM domain (Sterile alpha motif) 


0.023 


20.6 


1 


822-885 


1615 


UPF0I03 


Protein of unknown ftmctfon DUF52 


4.7e-64 


226.2 


1 


9-270 


1616 


C2 


C2 domain 


6.8e-36 


132.7 


2 


606- 

695:755- 

842 


1617 


C2 


C2 domain 


7,8e-35 


129.2 


2 


87- 

176:236- 
323 


1618 


C2 


C2 domain 


0.16 


2.4 


1 


265-346 


1619 


7tm_l 


7 transmembrane receptor (rhodopsin 
family) 


3.9e-20 


80.3 


1 


217-427 


1620 


K tetra 


K4- channel tetramerisation domain 


Ue-25 


98.7 


1 


3-101 


1620 


era 


BTB/POZ domain 


9 


-22,4 


1 


21-104 


1623 


cyclin 


Cyclin, N-terminal domain 


0.057 


-1.4 


1 


46-149 


1624 


zf-C2H2 


Zinc finger, C2H2 type 


8.9e~19 


75.8 


4 


34-57:71- 
93:112- 
134:143- 
165 


1627 


PDZ 


PDZ domain (Also known as DHR or 
GLGF) 


4.2e-16 


66.9 


1 


66-149 


1628 


MttA Hcf 
106 


mttA/Hcfl 06 family 


2.5 


-14.8 


1 


54-107 


1628 


Tropomyo 
sin 


Tropomyosin 


4 


-114.1 


1 


74-272 


1628 


Syntaxin 


Syntaxin 


5.7 


-103.5 


1 


82-402 


1628 


kinase 


Phosphatidyiinositol 3- and 4-kinase 


8.5 


-118.0 


1 


125-342 


1628 


HlyD 


HlyD family secretion protein 


9.4 


-64.2 


1 


129-400 


1628 


UPF0089 


Uncharacterised protein family 


10 


-132.7 


1 


150-338 


1629 


rrm 


RHA recognition motif. 


1.7e-47 


171.2 


2 


72- 
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Description 


E-value 


Score 


Repeats 


Position 














142:156- 














225 


1632 


Ri bosom a 


Ribosomal protein L 10 


4.2 


-31.6 


] 


70-J73 




1 L10 












1632 


Ribosoma 


Ribosomal protein L10 


4.2 


-31.6 


1 


70^73 




I L10 










1 


1635 


death 


Death domain 


1.5 


2.0 


1 


261^48 


1636 


death 


Death domain 


1.5 


2.0 


1 


73-160 


1637 


Gelsolin 


Gelsolin repeat 


4.1e-92 


319.4 


6 


27-76:148- 














188:265- 














307:398- 














451:523= 














564:626- 














668 


1639 


TBC 


TBC domain 


2e-08 


15.3 


1 


98-293 


1640 


TBC 


TBC domain 


2e-07 


13 


1 


98-297 


1641 


homeobox 


Homeobox domain 


0.0097 


7.0 


I 


83-135 


1646 


transmcm 


Tetraspanin family 


I.6e-75 


264.3 


1 


18-264 




brane4 












1652 


LacY^sy 


LacY proton/sugar symporter 


4.9 


^335.5 


1 


66-299 




mp 












1652 


oxidored 


NADH-Ubiquinone/plastoquinone 


7.6 


-1693 


I 


53-279 




ql 


(complex I) 










1653 


Colipase 

c 


Colipase, C-terminal domain 


4.3e-24 


93.5 


1 


21-65 


1654 


SH2 


SH2 domain 


0.0091 


3.5 


1 


1-83 


1658 


sun 


Translation Initiation factor SUII 


4.4e-46 


166.5 


1 


50-149 


1661 


jmjC 


jmjC domain 


0.00052 


13.9 


1 


308-415 


1663 


SH3 


SH3 domain 


0.011 


9.5 


1 


332-388 


1664 


UBA 


UBA/TS-N domain 


6,7e-06 


33.0 


1 


194-233 


1664 


TUDOR 


Tudor domain 


0.2 


-0*7 


1 


506-627 


1665 


Ribosoma 


Ribosomal protein S21 


0.0039 


1K7 


1 


10-62 




I S21 












1666 


transmem 


Tetraspanin family 


2.2e-71 


250.6 


1 


23-264 




brane4 












1669 


RuBisCO 


Ribulosc bisphosphate carboxylase, 


2.8 


-52.1 


1 


354-430 




small 


smal 










1671 


LRR 


Leucine Rich Repeat 


2e-50 


180.9 


12 


2Q-47-4R- 














71:72- 














95:96- 














118:119- 














142:143- 














166:167- 














189:190- 














213:214- 














235:236- 














259:260- 














283:284- 














307 


1671 


7tm 1 


7 transmembrane receptor 


0.0032 


-43.2 




439-688 


1671 


Cytidylylt 


Phosphatidate cytidylyltransferase 


7.1 


-89.9 




520-617 




rans 










* 


1671 


oxidored 


NADH-Ubiquinone/plastoquinone 


9.7 


-171.5 




475-685 




ql 












1671 


MerC 


MerC mercury resistance protein 


9.8 


*87.5 




534-632 


1672 


UCH-2 


UbiquhSn carboxyl -terminal hydrolase 


4e-28 


106.9 




292-364 






family 










1672 


UCH-1 


Ubiquitin carboxyMcrminal hydrolases 


1.9e-14 


61.4 


1 


35-66 
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E-valuc 


Score 


Repeats 


Position 






famil 










1673 


F-box 


F-box domain 


0.00071 




-j 


87-134 


1674 


Lamp 


Lysosome-associated membrane 
glycoprotein 


5.7 


*L9U 




351-653 


1675 


FGGY 


FGGY family of carbohydrate kinases, 
N-tcrmi 


2.7&-4S 






20-282 


1675 


FGGY_C 


FGGY family of carbohydrate kinases, 
C-termi 


5.5e-23 


8$.8 


- 


285-491 


1676 


KeratinJB 
2 


Keratin, high sulfur B2 protein 


7,4 


-8 1. 9 




24-201 


1678 


sjoo 


S-100/ICaBP type calcium binding 
domain 


8.3 


-9.6 




909-941 


1680 


pyr redox 


Pyridine nucieotide-disulphide 
oxidoreducta 


7.4e-24 


92.7 


1 


12-369 


1681 


WD40 


WD domain G-heta rpiiflnt 


9*3 e- 14 


59. 1 


V 


343:352- 
389:395- 
432:440- 
476:487- 
524:583- 
621 


1683 


Ribosoma 
1 L44 


Ribosomal protein L44 


le-38 


142.1 


I 


17-95 


1686 


WD40 


WD domain, G-beta repeat 


8.9e-l4 


59.2 


2 


8-43:50-86 


1688 


FHE 


Flagellar hook-basal body complex 
protein Fl 


5.1 


-27.5 


1 


673-763 


1690 


dCMP_cy 
t deam 


Cytidine and deoxycytidylate 
deaminase 


2e-13 


58.0 


I 


12-100 


1691 




ClOT Hnmfiin 


5.1 


-8.6 


] 


712^760 


1692 


cpn60 TC 
PI 


TCP-l/cpn60 chaperonin family 


0,012 


-260.5 


I 


32-187 


1693 


Glycopho 
rin A 


Glycophorin A 


4.4 


-43. 1 


I 


16-149 


1U7U 




T'inr finapr COW? tvnp 


A ^p- 1 ^ 




< 


115:120- 
143:174- 
198:210- 

233:329- 
353:363- 
386 


1698 


LRR 


Leucine Rich Repeat 


0.44 


17.0 


4 


37-58:59- 
80:81- 
102:103- 
125 


1699 


VHS 


VHS domain 


9.5e-60 


21 L9 


1 


5-146 


1700 


rrm 


RNA recognition motif. 


3.6e-23 


90.4 


3 


128- 

203:332- 
402:413- 
480 


1701 




Ankyrin repeat 


3.8e- 
101 


349.4 

• 


12 


12-44:45- 

77:79- 

111:112- 

144:145- 

177:179- 

211:212- 

244:245- 

277:278- 

310:312- 
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E-value 


Score 


Repeats 


Position 






• 








344:345- 
374:375- 
407 


1701 


URO-D 


Uroporphyrinogen delbrboxylase 
fURO-D^ I) 


7.8 


-229.7 


1 


79-354 


1702 




Prnti^in lrinn<;P dnmEurr 




£Q A 
"Oo.u 


i 
i 


^1 77S 
J l"Z/<> 


1702 


HEAT 


HEAT repeat 


1.2 


15.6 


2 


342- 
537 




cyclin 


fvf^lin iVT—'t'onTi i n q 1 /i/immn 
\j y\f lUlf IN -let 111 111 al UUuimi) 




JU. d 


i 
i 


1 j /-z/y 


17or 




W 17 UUmttlH, VJ-DCia lcpcdl 




*>I.o 


c 
J 


z/o~ 

Ji /I- 

407:413- 

447:493- 

529:535- 

569 


1709 


sm 


SH2 domain 


0.002 


10.0 


1 


287-364 


171 A 


abhydrola 
se 


alpha/beta hydrolase fold 


1 la ^A 




1 

1 


124-355 


I /w 


aonyaroia 
se 2 


Phospholipase/Carboxylesterase 


1 1 

l.z 


O/t CI 


1 

I 


179-359 


171 1 

wll 




zjnc linger, wraC4 type ^Kirso nngcr ) 


A AA7 


2.5 J 


1 

1 


OOA *1^A 

280-320 


1711 
1 1 1 1 


pun 


rtiLJ-iinger 


1 Q 

1.7 


1 1 /I 

-1^.4 


1 
1 


282-323 


17IO 


WL>4V 


WD domain, G-beta repeat 




7y| n 


> 


67- 

101:123- 
157:164- 

290 


1714 


Apvltrarmf 

J"Wr V 1 11 al lOA 

erase 


A f' vl trfi tl (iff* flS 


VM/v4 1 


14 0 


1 

1 




1719 


helicasc 
C 


Heli case conserved C-terminal domain 


L2e-19 


78.7 


t 


397-475 


1719 


DEAD 


DEAD/DEAH box he! tease 


4.6e-07 


10.4 


I 


?~?10 
1 1/ 


1719 


IPT 


IsonenteiwI transferase 


7.9 


-160.7 


1 


17-177 


1720 


Na_CaJ2 

X 


Sodium/calcium exchanger protein 


8e-76 


265.3 


2 


109- 

249:471- 
616 


1720 


DUF205 


Domain of unknown function DUF 


5.3 


-112.7 


1 


85-253 


1720 


Fumamte 
red D 


Fumarate reductase subunit D 


8.4 


-49.5 


1 


518-620 


1721 


vwa 


von Willebrand factor type A domain 


9.6e-29 


108.9 


1 


34-205 


1721 


fn3 


Fibronectin type IU domain 


3e-l5 


64 J 


2 


212- 

287:332- 

413 


1724 


pkinase 


Protein kinase domain 


1.6e-06 


-4L3 


1 


40-359 


1725 


BRCT 


BRCA1 C Terminus (BRCT) domain 


le-15 


65.6 


3 


3-93:642- 

730:753- 

833 


1727 


BRCT 


BRCA1 C Terminus (BRCT) domain 


7.2 


1.2 


1 


45-130 


1728 


rnn 


RNA recognition motif. ^ 


l.lc-05 


32.4 


2 


545- 

612:880- 
942 


1728 


zf-CCCH 


Zinc finger C-x8-C-x5-C-x3-H type 


0.012 


22.2 


1 


285-311 


1728 


PWI 


PWI domain 


0.047 


-0.6 


1 


6-78 


1730 


PX 


PX domain 


2.5c-31 


117.5 


1 


94-211 


1731 


PMP22_C 
laudin 


PMP-22/EMP/MP20/Claudin family 


1.5e-56 


201.3 


1 


1-157 
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E-vahie 


Score 


Repeats 


Position 


1733 


homeobox 


Homeobox domain 


0.00037 


21.6 


i 


268-339 


1733 


VacJRusr 
on i 


Chordopox virus fusion protein 


5.2 


-21.5 


t 


286-364 


1733 


-CUl 1 


CUT domain 


8.5 


-38.0 


1 


159-240 


1734 


homeoboxl 


Homeobox domain 


0,00037 


21.6 


1 


268-339 


1734 


Vac Fusi j 
on " 1 


Chordopoxvirus fusion protein 


5.2 


-21.5 


1 


286-364 


1734 


CUT 


CUT domain 


8.5 


-38.0 


1 


159-240 


1738 


efhand 


EF hand 


1.5e-16 


68.4 


2 


40-68:76- 
104 


1742 


trypsin 


Trypsin 


6.5 


-55.6 


1 


33-255 


1743 


Exonuclea 
se 


Exonuclease 


Ue-36 


135.2 


1 


39-213 


1745 


Smr 


Smr domain 


0.0029 


13.0 


1 


1594-1672 


1745 


RyR 


RyR domain 


1.5 


-29.0 


1 


1312-1403 


1745 


L27 


L27 domain 


3.6 


5.1 


1 


15734626 


1746 


Smr 


Smr domain 


0.0029 


13.0 


1 


161M689 


1746 


RyR 


RyR domain 


1.5 


-29.0 


3 


1312-1403 


1746 


L27 


L27 domain 


3.6 


5.1 


1 


1590-1643 


1746 


Smg4 UP 
F3 


Smg-4/UPF3 family 


9.7 


-74.5 


1 


1379-1518 


1747 


mito_carr 


Mitochondrial carrier protein 


1.3e-81 


284.5 


3 


1-90:98- 
198:200- 
288 


1748 


OSCP 


ATP synthase delta (OSCP) subunit 


8.4 


-94.4 


1 


67-213 


1749 


Oxysterol 
BP 


Oxysterol -binding protein 


3.4e*78 


273.2 




488-875 


1749 


PH 


PH domain 


2.2e~09 


44.6 





52-146 


1750 


lactamase 
_B 


Metallo-beta-Iactamasesuperfamily 


4.1 


-28-9 




1-153 


1751 


lipoxygon 
ase 


Lipoxygenase 


9.5e- 
133 


454.4 




126-703 


1751 


PLAT 


PLAT/LH2 domain 


1.6e-33 


124.8 


^ 


2416 


1753 


RhojGDI 


RHO protein GDP dissociation 
inhibitor 


3.2e- 
113 


389.5 


j 


1-187 


1756 


GRAM 


GRAM domain 


7.5e-I7 


69.4 


j 


110-177 


1757 


homeobox 


Homeobox domain 


3.6e-29 


110.3 


1 


21-77 


1758 


lipocalin 


Lipocalin / cytosolic fatty-acid binding 
pr 


l.le-24 


95.4 




38-168 


1759 


ig 


immunoglobulin domain 


3.1e-13 


57.4 


2 


35- 

112:160- 

234 


1761 


oxidored 


NADH-Ubiquinone/piastoquinone 
(complex 1) 


0.078 


-128.7 


1 


16-259 


1761 


DUF250 


Domain of unknown function, DUF250 


0.45 


-85.1 


1 


162-337 


1761 


DUF6 


Integral membrane protein DUF6 


Z5 


-14.8 


1 


192-328 


1761 


secY 


eubacterial secY protein 


7,2 


-246.6 


1 


36-243 


1761 


FecCD 


FecCD transport family 


9.8 


-221.2 


1 


120-328 


1762 


Keratin B 
2 


Keratin, high sulfur B2 protein 
* , — 


3.3e-07 


32.9 


2 


11- 

113:114- 

204 


1763 


sugarjr 


Sugar (and other) transporter 


0.018 


-119,2 


1 


43-449 


1763 


oxidored 
ql 


NADM-Ubiquinone/plastoquinone 
(complex I) 


4.7 


-165.1 


1 


146-442 


1765 


VPS9 


Vacuolar sorting protein 9 (VPS9) 
domain 


4.5e-33 


123.3 


1 


380-484 


1765 


RA 

_ 


Ras association (RalGDS/AF-6) 
domain 


0.73 


-5.9 


1 


518-607 
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SEQ ID 


Model 
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E-valuc 


Score 


Repeats 


Position 


1766 


SSF 


Sodium:solute sym porter family 


5.9e- 


697.6 




106-535 








206 








1766 


oxidored_ 


NADH-Ubiquinone/plastoquinone 


4.8 


-165.2 


I 


216-521 






(complex 1) 










1766 


MIP 


Major intrinsic protein 


5.8 


-121.9 


I 


112-364 


1766 


DsbD 


Cytochrome C biogenesis protein 


8.6 


-97.3 


j 


195-425 






transmemb 










1767 


Peptidase 


. Peptidase family M3 , 


1.3e~ 


689.8 




251-701 




M3 




203 








1768 


RasGAP 


GTPase-activator protein for Ras-like 


1.3e-26 


101.9 




449-621 


1768 


PH 


PH domain 


0.012 


22.2 


.... ... 


112-236 


1768 


C2 


C2 domain 


2.6 


-10 9 


1 


749-119 


1768 


T*vmn 4*5 


Tvmnvinm 4S/70fCd nrnffMn 


4 2 


-201 9 








kd 70kd 












17fifl 

X i UO 


DMA nnl 


iJiN/\-ucpcnucni avin a. polymerase 




014 7 


— 

_j 




17fiR 
J /oo 


pirn 


rrlJU-Iinger 


o.y 




J — — 


214-273 


1 *7?A 
1 / /V 


rrm 


RJMA recognition motif. 


0.48 


5.6 


2 


238- 














323:352- 


— 












422 


1772 


ZI-Lzriz 


Zmc linger, C2H2 type 


1.6e-87 


304.2 


13 


49-71:197- 














219:225- 














247:253- 














275:281- 














303:309- 














Ol f .ITT 

33 1 :337- 




























18*7-101 




























443:449- 














471:477- 














499:505- 














527 


1772 


zf-BED 


BED zinc finger 


8.9 


-6.9 


1 


490-528 


1773 


ATP- 


ATP synthase subunit C 


5.4e-08 


40.0 


1 


62-127 




synt C 












1774 


zf-B box 


B-box zinc finger 


7.2e-l4 


59.5 


1 


90-131 


1774 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


1.6e-10 


48.4 


1 


15-58 


1774 


PHD 


PHD-fmger 


LI 


-10.3 


1 


14-61 


1774 


zf-ANl 


AN1 -like Zinc finger 


\A 


-3.2 


1 


15-53 


1774 


zf-UBRl 


Putative zinc linger in N-recognin 


7.6 


-23.6 


1 


93-146 


1775 


FH2 


Formin Homology 2 Domain 


5.4e-50 


179.5 


1 


21-455 


1775 


WH2 


WH2 motif 


0.0058 


23.3 


I 


441-456 


1777 


PHD 


PHD-finger 


8.7 


-18.6 


1 


493-522 


1779 


Branch 


Core-2/I-Branching enzyme 


3.9e-05 


-52.8 


1 


175-516 


1780 


TFIIS 


Transcription factor S-H (TFIIS) 


3.9 


-3.7 


1 


295-344 


1781 


SH3 


SH3 domain 


1.3e-13 


58.6 


1 


454-508 


1782 


filament 


Intermediate filament protein 


3.9 


-200.0 


1 


134-406 


1783 


SET 


SET domain 


2e-53 


190.9 


1 


3878-4006 


1783 


PHD 


PHD-finger 


4.7e-28 


106.6 


3 


20-71:72- 














118:147- 














200 


1^83 


HMGJjo 

X 


HMG (high mobility group) box 


0.0047 


12.3 


1 


710-773 


1783 


DC1 


DC1 domain 


8.5 


1.7 


3 


19-50:69- 














98:146-176 


1784 


zf-DHHC 


DHHC zinc finger domain 


2,4e-25 


97.7 


1 


90-154 


1785 


zf-C2H2 


Zinc finger, C2H2 type 


9.3e-14 


59.1 


4 


18-40:64- 














86:92- 
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115:178- 
200 


*1788 


ion trans 


Ion transport protein 


8.3e-l8 


72.6 


1 


438-672 


1788 


ank 


Ankyrin repeat 


L9e-06 


34.8 


2 


77- 

108:163- 
195 


1788 


Srg 


C*elegans Srg family integral 
membrane prot 


8 J 


-222.4 


I 


418-669 


1791 


ubiquitin 


Ubiquitin family 


0.0036 


24.0 


1 


124-195 


1792 


Gag^plO 


Retroviral GAG pIO protein 


4e-23 


90.2 


1 


1-89 


1798 


Ribosoma 
1 S12 


Ribosomal protein SI 2 


0.003 


-14.2 


I 


7-66 


1799 


efhatid 


EF hand 


Lle-07 


39.0 


3 


281- 
309:318- 
346:353- 
381 


1799 


Acyltransf 
erase 


Acyltransferase 


0,0001 


26.8 


1 


18-203 


1801 


zf-C2H2 


Zinc finger, C2H2 type 


3.2e-79 


276.6 


12 


160- 

182:188- 
210:216- 
238:244- 

ZOOIZ 11- 

294:300- 

377:431- 
453:459- 
481:487- 
509:515- 
537:543- 

JO J 


1801 


L1M 


L1M domain 


4.7 


-17.4 


] 




1802 


lectin c 


Lectin C-tvne domain 

* . 'WW L 1 1 1 V> ITUu VJ \J 111 CL III 


1 ,4e-25 


98 4 






1802 


lectin c 


Lectin C-type domain 


L4e-25 


98.4 


1 


143-250 


1804 


efhaiid 


EF hand 


2.5e-08 


41.1 


2 


82 


1806 


LRR 


Leucine Rich Repeat 


1 .5e-07 


38.5 


5 


130- 

151:152- 
173:174- 
195:196- 
217:221- 
243 


1807 


COX3 


Cytochrome c oxidase subunit III 


3.5 


-233.2 


1 


51-219 


1807 


CbiM 


CbiM 


9.2 


-93.3 


1 


114-249 


1807 


oxidored_ 
q5 N 


NADH-ubiquinone oxidoreductasc 
chain 4 


9.8 . 


-59.5 


1 


89-169 


1808 


Sulfotrans 
fer 


Sulfotransferase protein 


6.6 


-108.5 


1 


39-282 


1814 


Metalloph 

OS 


Calcineurin-like phosphoesterase 


0.23 


14.3 


1 


36-241 


1815 


rrm 


RNA recognition motif. 


0.67 


4.1 


1 


201-269 


1816 


filament 


Intermediate filament protein 


1.5 


-188.9 


1 


1-253 


1817 


MAGE 


MAGE family 


4.2e-17 


70.3 


2 


14* 

613:655- 
867 


1817 


Atrophin- 
1 


Atrophin-1 family 


9.5 


-684.8 


1 


4-783 


1818 


Tropomod 


Tropomodulin 


8.1 


-168.0 


1 


362-653 
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ulirt 












1819 


Hydrolase 


haloacid dehalogenase-like hydrolase 


3.4*21 


83.8 




28-282 


1820 


Fe hyd I 


Iron only hydrogenase large subunit, Ci- 


1.3e-92 


32L1 




204-516 




g_c 


te 






1 




1820 


Fe hyd_S 


Iron hydrogenase small subunit 


8.3 


-13.1 




529-576 




SU 








I 




1821 


filament 


Intermediate filament protein 


le-143 


490.8 


J\- — 

1 


55-366 


1822 


Keratin B 
2 


Keratin, high sulfur B2 protein 


0.082 


-51.3 


l 


12-174 


1824 


metalthio 


Metallothionem 


5.7e*17 


69.8 


i 


48-108 


1824 


AFP 


Insect antifreeze protein 


2.6 


14.5 




61-72 


1825 


C2 


C2 domain 


4.7e-53 


189.7 


2 


373- 














462:528- 














617 


1825 


RPH3A_e 


Rabphilin-3A effector domain 


0.035 


-127.1 


1 


1-237 




ffector 












1825 


zf-MIZ 


MIZ zinc finger 


04 


-10.5 


1 


80-111 


1825 


PHD 


PHD-finger 


0.97 


-9.7 


1 


62-108 


1825 


L1M 


LIM domain 


2.2 


-14*6 


1 


80-128 


1825 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 

£2 — i d-L v ft »/ 


3,1 


-4.0 


1 


63-105 


1828 


efhand 


EF hand 


8.4e-08 


39,4 


3 


106- 














134:142- 














170:407- 














435 


1829 


60s_ribos 


60s Acidic ribosomal protein 


7.4 


-38.2 


1 


163-291 




omal 










1831 


Toktin 


Tektin family 


1.6 


-227.3 


1 


1-150 


1833 


EGF 


EGF-like domain 


l.le-32 


122.0 


s 


26-54:57- 














85:92- 














124:156- 














187:194- 














226 


1833 


laminin E 


Laminin EGF-like (Domains III and V) 


1.6 


-5.9 


1 


29-63 




GF 










1833 


Bowman- 


Bowman-Birk serine protease inhibitor 


4.8 


-16.7 


1 


37-85 




Birkjeg 












1833 


metalthio 


Metal lothionein 


5.5 


-10.1 


1 


59-117 


1834 


7tmJ 


7 transmembrane receptor (rhodopsin 


4>8e-05 


-0.8 


1 


31-273 






family) 






* 




1834 


7tm_2 


7 transmembrane receptor (Secretin 


5.5 


-151.6 


1 


9-248 






family) 










1835 


rrm 


RNA recognition motif. 


0.05 


15.7 


1 


75-141 


1835 


Met 10 


Met-10+ like-protein 


0.065 


-100.4 


1 


333-583 


1835 


UPF0020 


Putative RNA methylase family 


0.073 


-53.8 


1 


410-566 






UPF0020 










1835 


PCMT 


Protein-L-isoaspartate(D-aspartate) O- 


0.2 


-114,1 


1 


391-543 


1835 


Methyltra 


Putative methyltransferase 


2.1 


-82.0 


1 


424-566 




nsf 4 












1835 


notch 


Notch (DSL) domain 


6.1 


-10,5 


1 


233-271 


1836 


Kelch 


Kelch motif 


3e-33 


123.9 


4 


242- 














288:290- 












* 


331:333- 














378:380- 














435 


1836 


BTB 


BTB/POZ domain 


2.3 


-15.2 


1 


1-129 


1837 


tsp^l 


Thrombospondin type 1 domain 


0.0014 


23.3 


5 


139- 














196:203- 














256:257- 



354 



WO 02/070539 



PCT/US02/05095 



SEQ ID 


Model 


Description 


E-value 


Score 


Repeats 


Position 






1 








313:320- 
370:374- 
432 


1837 


EB 


EB module 1 


6.4 


-10.5 


1 


330-399 


1838 


adh zinc 

UUII jULMu 


Zinc-hindine: dehydrogenase U 




7 

17 J. / 


i 


AAJXfLQ 


1838 


Lipase G 
DSL 


Li oase/ Ac vl hydrolase with GDSL-Iik^L 
motif 


5.5 


-90 4 


i 
i 


ifn i wo 

IVJ-l OV 


1839 


pkinase 


Protein kinase domain 


2.4e-05 


-58.0 


I 


81-333 


1841 




7(iic finder C2H2 tvne 


4.4e-30 


Hid 


7 


141 
l £ rl- 

191:197- 

247:350- 

400:406- 
434 


1841 


KRAB 


KRAB box 


5.1e-23 


89.9 




44-84 


1841 


TFI1S 


Transcription factor S-IT (TFIIS) 


2.4 


-L9 






1841 


Lenti viral 
Tat 


Lentiviral Tat protein 


4.9 


-23.0 


-j — 




1841 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


7.2 


-7*1 




352-41 1 


1841 


rubredoxi 
n 


Rubredoxin 


7.4 


-11.1 






1842 


E2F TDP 


Transcription factor E2F/dimcrisation 
partne 


1.5 


-95.1 




1 <J~L.£*J 


1843 


V ATPas 
e sub a 


V-tvne ATPase 1 1 fikDa stihnnit fflrnilv 


Q 


1258.7 




^-U O.J I 


1843 


UCR 14k 
D 


Ubiquinol-cy to chrome C reductase 
com pi 


3.2 


-40.4 


j 


6-112 


1843 


FliP 


FliP family 


4.8 


-148.9 


1 


533-661 


1843 


PsbH 


Photosystem H 10 kDa phosphoprotein 


7.1 


5.1 


j 


751-803 


1844 


aajjerme 
ases 


Amino acid permease 


5.6e-08 


-125.3 


1 


28-529 


1844 


Aajxans 


Transmembrane amino acid transporter 
pro 


0.71 


-178.9 


1 


30-408 


1844 


BPDjran 
sp_2 


Branched-chain amino acid transport 
syst 


3,3 


-116.9 


-j 


175-440 


1844 


MVIN 


Virulence factor MVIN 


3.5 


-240.9 


j 


87-515 


1844 


FecCD 


FecCD transport family 


53 


-216.0 


J 


311-521 


1844 


ion trans 


Ion transport protein 


6.2 


-11.7 




288-495 


1844 


7tm 5 


7TM chemoreceptor 


7.1 


-168.6 




238-498 


1848 


sugar_tr 


Sugar (and other) transporter 


0.0072 


-107.9 


| 


12-407 


1848 


xan_ur_pe 
rmease 


Permease family 


5.6 


-196.7 




38-359 


1848 


FecCD 


FecCD transport family 


6.3 


-217.4 




56-287 


1848 


Nucleosid 
e tran 


Nucleoside transporter 


8.1 


-162.4 


J 


82-365 


1848 


PUCC 


PUCC protein 


9.8 


-283.2 




51-403 


1849 


Filamin 


Filamin/ABP280 repeat 


2.3e-20 


81.1 




396-494 


1849 


zf-B_box 


B-box zinc finger 


9.3e-ll 
* 


49,2 


2 


130- 

176:186- 
227 


1849 


zf-C3HC4 


Zinc finger, C3HC4 type (RING finger) 


0.0016 


24.4 


1 


29-62 


1853 


ank 


Ankyrin repeat 


Ue-98 


341.3 


10 


485« 

517:518- 

550:551- 

583:584- 

616:617- 
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1 








650:651- 














683:684- 














716:717- 






i 








749:750- 






1 








780:782- 






1 








814 


1853 


pkinase 


Protefh kinase domain 


L6e-50 


181.3 


1 


22-286 


1854 


ras 


Ras family 


h4e-13 


17.8 


1 


5-194 


1855 


Acyltransf 


Acyltransferase 


3e-34 


127,2 


1 


92-2%9 




erase 












1856 


wap 


WAP-type (Whey Acidic Protein) 


0.0025 


18.7 


1 


31-72 






Tour-disulfi 










1857 


arf 


ADP-ribosylation factor family 


2.5e-ll 


24.6 


1 


7-193 


1857 


GTPJEFT 


Elongation factor Tu GTP binding 


1.6 


-82.5 


1 


19-198 




U 


domain 










1858 


zf-C2H2 


Zinc fin per C2H2 tvne 






<v 

J 


oZ- 




























132:138- 














161:364- 














386:392- 














41fi 


1858 


PHD 


PHD-finger 


7.7 


-18.1 


I 


365-400 


1859 


zf-C2H2 


Zinc finger, C2H2 type 


7.2e-20 


79.4 


4 


119- 














141:147- 














169:175- 














198:214- 














237 


1860 




BTB/POZ domain 


L7e-29 


111.4 


1 


22-126 


1860 


zf-C2H2 


Zinc finger, C2H2 type 


5.1Q-14 


60.0 


3 


373- 














395:401- 














423:429- 














452 


1860 


K tetra 


K+ channel tetramerisation domain 


0.44 


-30.2 


I 


34-124 


I860 


zf-BED 


BED zinc finger 


8.9 


-6.9 


1 


363-396 


1861 


SNF2 N 


SNF2 and others N-terminal domain 


2.1C-72 


254.0 


1 


137-473 


1861 


hel icase 
C 


Helicase conserved C-terniinal domain 


1,46-24 


95,1 


1 


559-632 


1861 


DEAD 


DEAD/DEAH box helicase 


0.26 


-59.6 


1 


124-337 


1861 


SEC-C 


SEC-C motif 


0.43 


13.8 


1 


426-446 


1862 


ras 


Ras family 


2.4e-69 


243.8 


1 


19-217 


1862 


arf 


ADP-ribosylation fkctor family 


4.1e-05 


-S3. 5 


1 


1-182 


1864 


rrm 


RNA recognition motif. 


0.053 


15.4 


1 


129-191 


1867 


T-box 


T-box 


4«3e~09 


29.9 


1 


1-92 


1870 


F=box 


F-box domain 


1.8 


15.0 


1 


124-171 


1871 


PH 


PH domain 


l.le-13 


58.9 


I 


170-273 


1872 


isodh 


Isocitrate/isopropylmalate 


1.4e- 


423.9 


1 


10-383 






dehydrogenase 


123 








1873 


ank 


Ankyrin repeat 


1.8e-08 


41.5 


2 


39-71:72- 














104 


1874 


ank 


Ankyrin repeat 


1.8e-08 


41.5 


2 


67-99:100- 














132 


1875 


PAP2 


PAP2 supeffamily 


0.0034 


13.0 


1 


1-107 


1876 


DUF51 


Protein of unknown function DUF51 


5.7e-13 


52.5 


1 


107-261 


1877 


zf-C2H2 


Zinc finger, C2H2 type 


1.5e-72 


254.5 


9 


285- 














307:313- 














335:341- 














363:369- 














391:397- | 
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Score 
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A vJltlllll 


1 

\ 












419:425- 
447:451- 
473:479- 
501:507- 
529 


£877 


KRAB 


KRAB box 


2.7e-21 


84.2 


1 


49-89 


&77 


zf-BED 


BED zinc finger 


0.74 


2.7 


1 


344-392 


1877 


TFI1S 


Transcriotion factor S-1I f 1TIIS} 


1.5 


-0.0 


1 


369-407 


1877 


GATA 


GATA zinc finger 


2.1 


-7.1 


1 


477-524 


1877 


LIM 


1 .TTvf ilfitTiiiiil 

JUliVJl Ulfllltllll 


2,5 


-15.1 


\ 




1877 


E6 


Early Protein (E6) 


5.7 


-66.7 


1 


315-402 


lo/o 


AAA 


A * I linen ■fintYiilvt no0f»o5 ntp»fl Wvth VftflMlC 






1 
1 




1878 


Sigma54_ 4 
aetivat 


$igma-54 interaction domain 


4.6 


-165.4 


1 


184^365 




cot? 


Sodium :solute $ym porter family 


H.tse- 


J /O.J 


1 




1880 


SSF 


Sodium :so!ute symporter family 


1.3e- 
198 


673-2 


1 


58-487 


1881 


perilipin 


Perilipin family 


6.7e-53 


189,2 


1 


12-369 


1882 


aminotran 
1 2 


Aminotransferase class I and II 


7,5e-07 


25.0 


1 


165-515 


1883 


DSPc 


Dual specificity phosphatase, catalytic 
doma 


2.5e-30 


114.2 


1 


54-199 


1884 


ig 


Immunoglobulin domain 


7.2e-l4 


59.5 


2 


34- 

111:155- 
229 


1 DOC 


MUM 


Mt>M^/J/5 family 


A £.a. 1 1 

4.oe^l2 


-O/.J 


i 
i 


1 A£ mi 


iboo 


iveicn 


V 1 /V l-> msvl-*-P 

rLCicn mot it 


l.Ke»oi 


11 A 1 


c 
D 


Zo/i- 

JUO.jU?- 
JJU.JJO" 

453:455- 
495:497- 
544 


1886 


BTB 


BTB/POZ domain 


5.3e-32 


119.7 




26-134 


1887 


UBA 


UBA/TS-N domain 


1.9e-05 


31.5 


J 


726-765 


1887 


rrm 


RNA recognition motif 


0.00026 


27,8 




1342-1407 


1887 


MARCKS 


MARCKS family 


43 


-90.8 




270-571 


1888 


UBA 


UBA/TS-N domain 


1.9e-05 


31.5 




726-765 


1888 


rrm 


RNA recognition motif 


0.00026 


27.8 


j 


1303-1368 


1888 


MARCKS 


MARCKS family 


4.3 


-90.8 




270-571 


1889 


UBA 


UBA/TS-N domain 


L9e-05 


31.5 


J , 


726-765 


1889 


rrnt 


RNA recognition motif 


0.00026 


27.8 




1245-1310 


1889 


MARCKS 


MARCKS family 


4.3 


-90.8 




270-571 


1889 


Atrophin* 
1 


Atrophin-1 family 


6.9 


-676.1 


i — > 


237-986 


1890 


Na_H_Ex 
changer 


Sodium/hydrogen exchanger family 


2.4e-18 


74.4 




119-520 


1890 


Na_CaJB 

X 


Sodium/calcium exchanger protein 


3.1 


-43.6 




270-384 


1890 


7tm 5 


7TM chemoreceptor 


3.8 


-163.7 




259-503 


1890 * 


Abi 


CAAX amino terminal protease family 


5.9 


-25.8 




309-401 


1890 


oxidored 

q» 


NADH-Ubiquinone/plastoquinone 


7.9 


-169.7 




251-502 


1891 


Acyl- 
CoAJiydr 

0 


Cytosolic long-chain acyl-CoA thioeste 


8.9C-72 


251.9 


2 


26- 

168:200- 
336 


1891 


tRNA ant 


OB-fold nucleic acid binding domain 


3.5 


5.8 


\ 


230-307 
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i 












1892 


zf-DHHC 


DHHC zinc finger domain 


8.6e-18 


72.5 


1 


145-207 


1894 


SPRY 


SPRY domain 


4.5e-ll 


50.2 


1 


353-474 


1894 


fn3 


Fibronectin type III domain 


0.054 


17.6 


I 


165-258 


1895 


dsrm 


Double-gtranded RNA binding motif 
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UBIQUITIN-CONJUGATING 
ENZYME 


£ 2" S 
o o * 

ggi 

o o c| 

lis 

U o u 

Ilii 

c/oo>>: 

H H H N 


UBIQUITTN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


LIGASE UBIQUITIH UBIQUITIN- 
CONJUGATING ENZYME, YEAST 


LIGASE UBIQUITIN, UBIQUITIN- 
CONJUGATING ENZYME, YEAST 


UBIQUITIN CONJUGATION UBCl; 
UBIQUITIN CONJUGATION, LIGASE 


UBIQUITIN CONJUGATION UBCl; 
UBIQUITIN CONJUGATION, LIGASE 




UBIQUITIN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


UBIQUITIN CONJUGATION UBC2; 
UBIQUITIN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


UBIQUITTN CONJUGATION UBC2; 
UBIQUITTN CONJUGATION, 
UBIQUITIN-CONJUGATING 
ENZYME 


LIGASE UBIQUITTN, UBIQUITIN- 
CONJUGATING ENZYME. YEAST 


LIGASE UBIQUITIN, UBIQUITIN- 1 


Coumpound 




UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN: A, B, C; 
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UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 


j UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: NULL; 


UBIQUITIN CONJUGATING 
: ENZYME; CHAIN: NULL; 




! UBIQUITIN-CONJUGATING 
1 ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN-CONJUGATING 
ENZYME RAD6; CHAIN; A, B, C; 
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ENZYME RAD6; CHAIN: A, B, C; 


UBIQUITIN CONJUGATING 
ENZYME; CHAIN: A; 
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MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


CONTRACTILE PROTEIN 
CONTRACTILE PROTEIN, KAR3, 
KINESIN-RELATED PROTEIN, 
MOTOR 2 PROTEIN, ATPASE, P- 
LOOP, MICROTUBULE BINDING 
PROTEIN 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETON 


MOTOR PROTEIN MOTOR PROTEIN, 
CYTOSKELETON 




COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN IIA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITORS 


COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 
AUTOPROTHROMBIN HA; 
HYDROLASE, SERINE 
PROTEINASE), PLASMA CALCIUM 
BINDING, 2 GLYCOPROTEIN, 
COMPLEX (BLOOD 
COAGULATION/INHIBITOR) 


SURFACE PROTEIN MEROZOITE 
SURFACE ANTIGEN 1, MAJOR 
BLOOD-STAGE EGF-LIKE DOMAIN, 
EXTRACELLULAR, MODULAR 
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KINESIN-LIKE PROTEIN KAR3; 
CHAIN: NULL; 


; KJNESIN HEAVY CHAIN; 
CHAIN: A, B, C, D; 


KINESIN HEAVY CHAIN; 
CHAIN: A,fi,C,D; 




ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


ACTIVATED PROTEIN C; 
CHAIN: C, L; D-PHE-PRO-MAI; 
CHAIN: P; 


MEROZOITE SURFACE 
PROTEIN 1; CHAIN: A; 
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PROTEIN, SURFACE 2 ANTIGEN, 
MALARIA VACCINE COMPONENT, 
SURFACE PROTEIN 


GLYCOPROTEIN MEMBRANE . 
COFACTOR PROTEIN (MCP); VIRUS 
RECEPTOR, COMPLEMENT 
COFACTOR, SHORT CONSENSUS 
REPEAT, 2 SCR, MEASLES VIRUS, 
GLYCOPROTEIN 


APOPTOSIS TRAIL; DR5; LIGAND- 
RECEPTOR COMPLEX, TRIMERIC 
JELLY-ROLL, TOF-R 2 
SUPERFAMILY, APOPTOSIS 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, j 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


BLOOD COAGULATION, SERINE 
PROTEASE, COMPLEX, CO-FACTOR, 
2 RECEPTOR ENZYME, INHIBITOR, 
GLA, EGF, 3 COMPLEX (SERINE 
PROTEASE/COFACTOR/LIGAND) 


! HYDROLASE/HYDROLASE ! 
INHIBITOR PROTEIN-PEPTIDE 
COMPLEX 
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TNF-RELATED APOPTOSIS 
INDUCING LIGAND; CHAIN: B; 
DEATH RECEPTOR 5; CHAIN: 

A; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE | 
(DFFRCMK) WITH CHAIN: C; 1 


BLOOD COAGULATION 
FACTOR Vn A; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T, U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


BLOOD COAGULATION 
FACTOR VIIA; CHAIN: L, H; 
SOLUBLE TISSUE FACTOR; 
CHAIN: T 3 U; D-PHE-PHE-ARG- 
CHLOROMETHYLKETONE 
(DFFRCMK) WITH CHAIN: C; 


DES-GLA FACTOR VIIA 
(HEAVY CHAIN); CHAIN: H, I; 
DES-GLA FACTOR VIIA (LIGHT 
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BLOOD COAGULATION FACTOR 
STUART FACTOR; BLOOD 
COAGULATION FACTOR, SERINE 
PROTEINASE, EPIDERMAL 2 
GROWTH FACTOR LIKE DOMAIN 
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CI u u 

IP 

m o ffl 


COMPLEX 

(TRANSMEMBRANE/GLYCOPROTEI 
N) MHC GLYCOPROTEIN, COMPLEX 
(TRANSMEMBRANE/GLYCOPROTEI 
N) 


COMPLEX (MHC 
PROTEIN/ANTIGEN) DRA, DRB 1 
01010; COMPLEX (MHC 
PROTEIN/ANTIGEN), 
HISTOCOMPATIBILITY ANTIGEN 


COMPLEX (MHC 
PROTEIN/ANTIGEN) DRA, DRB I 
01 .010; COMPLEX (MHC 
PROTEIN/ANTIGEN), 
HISTOCOMPATIBILITY ANTIGEN 


IMMUNE SYSTEM HLA-DR2, 
MYELIN BASIC PROTEIN, 
MULTIPLE SCLEROSIS, 2 
AUTOIMMUNITY, IMMUNE SYSTEM 


I IMMUNE SYSTEM HLA-DR2, | 
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BLOOD COAGULATION 
FACTOR XA; CHAIN: L, C; 
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HLA-DR3; CHAIN: A, B; CLIP; 
CHAIN: C; 


! HLA-DR3; CHAIN: A, B; CLIP; 
, CHAIN: C; 
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HLA-DRl CLASS II 
HISTOCOMPATIBILITY 
PROTEIN; CHAIN: A, B, D, E, G, 
H, J, K; HLA-A2; CHAIN: C, F, 1, 
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MYELIN BASIC PROTEIN, 
MULTIPLE SCLEROSIS, 2 
AUTOIMMUNITY, IMMUNE SYSTEM 


IMMUNE SYSTEM HLA-DR2, 
MYELIN BASIC PROTEIN, 
MULTIPLE SCLEROSIS, 2 
AUTOIMMUNITY, IMMUNE SYSTEM 


IMMUNE S YSTEM HLA-DR2, 
MYELIN BASIC PROTEIN, 
MULTIPLE SCLEROSIS, 2 
AUTOIMMUNITY, IMMUNE SYSTEM 


IMMUNE SYSTEM MHC CLASS II 
DR2A 


IMMUNE SYSTEM MHC CLASS n 
DR2A 


IMMUNE SYSTEM RING6, HLA- 
DMA; RING7, HLA-DMB; 
HISTOCOMPATIBILITY PROTEIN, 
IMMUNE S YSTEM 


I HISTOCOMPATIBILITY ANTIGEN I- 
AK HISTOCOMPATIBILITY 
ANTIGEN, MHC, PEPTIDE COMPLEX 


Coiunpound 


DR2; CHAIN: B, E; HLA-DR2; 
CHAIN: C,F; 


HLA-DR2: CHAIN: A, D; HLA- 
DR2; CHAIN: B, E; HLA-DR2; 
CHAIN: C, F; 


HLA-DR2; CHAIN: A, D; HLA- 
DR2; CHAIN: B, E; HLA-DR2; 
CHAIN: C, F; 


MAJOR HISTQCQMP ATTB ILITY 
■ COMPLEX ALPHA CHAIN; 
CHAIN: A, D; MAJOR 
HISTOCOMPATIBILITY 
COMPLEX BETA CHAIN; 
CHAIN: B, E; MYELIN BASIC 
PROTEIN; CHAIN: C, F; j 


MAJOR HISTOCOMPATIBILITY 
COMPLEX ALPHA CHAIN; 
CHAIN: A, D; MAJOR 
HISTOCOMPATIBILrrY 
COMPLEX BETA CHAIN; 
CHAIN: B, E; MYELIN BASIC 
PROTEIN; CHAIN: C, F; 


CLASS II 

HISTOCOMPATIBILITY 
ANTIGEN, M ALPHA CHAIN: A; 
CLASS II 

HISTOCOMPATIBILITY 
ANTIGEN, M BETA CHAIN: B; 


MHC CLASS n I-AK; CHAIN: A, 
B, P; HEN EGGWHITE 
LYSOZYME PEPTIDE 
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I CHLAMYDOMONAS, FLAGELLA | 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


TRANSCRIPTION RNAIP; RANGAP; 
GTPASE-ACTIVATING PROTEIN 
FOR SPII, GTPASE-ACTIVATING 
PROTEIN, GAP, RNAIP, RANGAP, 
LRR, LEUCINE- 2 RICH REPEAT 
PROTEIN, TWINNING, 
HEMIHEDRAL TWINNING, 3 
MEROHEDRAL TWINNING, 
MEROHEDRY 




TRANSFERASE CASEIN KINASE 
BETA SUBUNIT (1-1 82), SER/THR 
PROTEIN KINASE, 2 ZN FINGER 


: TRANSFERASE CASEIN KINASE 
, BETA SUBUNIT (1-182), SER/THR 
| PROTEIN KINASE, 2 ZN FINGER 




OXIDOREDUCTASE 
FERROCYTOCHROME CVOXYGEN 
OXIDOREDUCTASE; 
OXIDOREDUCTASE, 
CYTOCHROME(C)-OXYGEN s 
CYTOCHROME C 2 OXIDASE 


1 


. Q 

5 u 
w S 

&i H 2 S 
h P eJ w 


3 
| 




HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


! GTPASE-ACTIVATING 
i PROTEIN RNA1 SCHPO; 
CHAIN: A, B; 
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DNA-DIRECTED RNA 
POLYMERASE; CHAIN: A, B; ! 
DNA-DIRECTED RNA 
POLYMERASE; CHAIN: C; DNA- 
DIRECTED RNA POLYMERASE; 
CHAIN: D; DNA-Df RE CTED 
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TRANSCRIPTION REGULATION 
PROTO-ONCOGENE, NUCLEAR 
BODIES (PODS), LEUKEMIA, 2 
TRANSCRIPTION REGULATION 
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LIGASE CBL } UBCH7, ZAP-70, E2, 
UBIQUlTiN, E3 ? 

PHOSPHORYLATION, 2 TYROSINE 
KINASE, UBIQUITINATION, 
PROTEIN DEGRADATION, 


ZINC-BINDING PROTEIN ZINC- 
BINDING PROTEIN, XNF7, BBOX, 
DEVELOPMENT, 3 MID-BLASTULA- 
TRANSITION 


METAL BINDING PROTEIN RING 
FINGER PROTEIN MATl; RING 
FINGER (C3HC4) 
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RNA POLYMERASE; CHAIN: E; | 




TRANSCRIPTION FACTOR 
PML; CHAIN: NULL; 


VIRUS EQUINE HERPES VIRUS- 
1 (C3HC4, OR RING DOM AIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


VIRUS EQUINE HERPES VIRUS- 
I (C3HC4, OR RING DOMAIN) 
ICHC 3 (NMR, 1 STRUCTURE) 
ICHC 4 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQUITTN-CONJUGAUNG 
ENZYME E12-18 KDA UBCH7; 
CHAIN: C; 


SIGNAL TRANSDUCTION 
PROTEIN CBL; CHAIN: A; ZAP- 
70 PEPTIDE; CHAIN: B; 
UBIQU1HN-CONJUGATING ! 
ENZYME E12-1 8 KDA UBCH7; 
! CHAIN: C; 


; NUCLEAR FACTOR XNF7; 
CHAIN: NULL; 


CDK-ACTIVAUNG KINASE 
ASSEMBLY FACTOR MATl; 
CHAIN: A; 


| RAGl; CHAIN: NULL; 
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GTPASE/NUCLEAR PROTEIN) 
COMPLEX (SMALL 
GTPASE/NUCLEAR PROTEIN), 
SMALL GTPASE, 2 NUCLEAR 
TRANSPORT 


COMPLEX (GTP- 

BINDING/EFFECTOR) RAS-RELATED 
PROTEIN RAB3A; COMPLEX (GTP- 
BINDING/EFFECTOR), G PROTEIN; 
EFFECTOR, RABCDR, 2 SYNAPTIC 
EXOCYTOSIS, RAB PROTEIN, 
RAB3A, RABPHILIN 


COMPLEX (GTP- 

BINDING/EFFECTOR) RAS-RELATED 
PROTEIN RAB3A; COMPLEX (GTP- 
BINDING/EFFECTOR), G PROTEIN, 
EFFECTOR, RABCDR, 2 SYNAPTIC 
EXOCYTOSIS, RAB PROTEIN, 
RAB3A, RABPHILIN 


! HYDROLASE CDC42/CDC42GAP; 

CDC42/CDC42GAP; TRANSITION 
: STATE, G-PROTEIN, GAP, CDC42, 
: ALF3., HYDROLASE 


! HYDROLASE G PROTEIN, 
1 VESICULAR TRAFFICKING, GTP 
HYDROLYSIS, RAB 2 PROTEIN, 
NEUROTRANSMITTER RELEASE, 
HYDROLASE 


HYDROLASE G PROTEIN, 
VESICULAR TRAFFICKING, GTP 
HYDROLYSIS, RAB 2 PROTEIN, 
NEUROTRANSMITTER RELEASE, 
HYDROLASE 
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PORE COMPLEX PROTEIN 
NUP358; CHAIN: B, D; 


RAB -3 A; CHAIN: A; 
j RABPHILIN-3A; CHAIN: B; 


RAB-3A; CHAIN: A; 
RABPHILIN-3A; CHAIN: B; 


GTP BINDING PROTEIN (025K); 
CHAIN: A; GTPASE 
ACTIVATING PROTEIN (RHG); 
CHAIN: B; 


RAB3A; CHAIN; A; 


RAB3A; CHAIN: A; 
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GLYCOGEN METABOLISM, 
TRANSFERASE, 

SERINE/THRE ONINE-PRO TEEN, 2 
KINASE, ATP-BINDING, 
CALMODULIN-BINDING 


SERINE KINASE SERINE KINASE, 
TTTIN, MUSCLE, AUTOINHIBITION 


SERINE KINASE SERINE KINASE, 
TITTN. MUSCLE. AUTOINHffimON 




INSECT IMMUNITY INSECT 
IMMUNITY, LPS-BINDING, 
HOMOPHILIC ADHESION 


CELL ADHESION NEURAL CELL 
ADHESION 


GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR FGF, FGFR, 
IMMUNO GLOBULIN-LIKE, SIGNAL 
TRANSDUCTION, 2 DIMERIZATION, 
GROWTH FACTOR/GROWTH 
FACTOR RECEPTOR 
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NEURAL CELL ADHESION 
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IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


CELL ADHESION NCAM; NCAM, 
IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN 


IMMUNE SYSTEM FC-EPSILON RI- 
ALPHA; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BIND3NG 2 PROTEIN 


IMMUNE SYSTEM FC-EPSILON RI- 
ALPHA; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTB^eS^ ~ ' 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FC (EPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN,, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM HIGH AFFINITY 
IGE-FC RECEPTOR, FQEPSILON) 
IGE-FC; IMMUNOGLOBULIN FOLD, 
GLYCOPROTEIN, RECEPTOR, IGE- 
BINDING 2 PROTEIN, IGE 
ANTIBODY, IGE-FC 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, 
! IMMUNOGLOULIN, LEUKOCYTE,, 
CD32 


IMMUNE SYSTEM, MEMBRANE 
PROTEIN CD32; FC RECEPTOR, ] 
IMMUNO GLOULIN, LEUKOCYTE, 
CD32 


IMMUNE SYSTEM RECEPTOR BETA : 
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NEURAL CELL ADHESION 
MOLECULE; CHAIN: A, B, C, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 

i EPSILON CHAIN C REGION; 

\ CHAIN: B, D; 


HIGH AFFINITY 
IMMUNOGLOBULIN EPSILON 
RECEPTOR CHAIN: A; IG 
EPSILON CHAIN C REGION; 
CHAIN: B, D; 


FC RECEPTOR 
FC(GAMMA)REA; CHAIN: A; 


FC RECEPTOR 
FC(GAMMA)RIIA; CHAIN: A; 
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HYDROLASE C2 DOMAIN, 
PHOSPHOT1DYLINOSITOL, 
PHOSPHOTASE. HYDROLASE 




STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEEN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


ENDOCYTOSIS/EXOCYTOSIS 
NSECI; PROTEIN-PROTEIN ! 
COMPLEX. MULTI-SUBUNIT 


COMPLEX (DNA-BINDTNG 
PROTEIN/DNA) TUS; TER; DNA- 
BINDING, DNA REPLICATION, 2 
COMPLEX (DNA-BINDING 
PROTEIN/DNA) 


MEMBRANE PROTEIN FOUR HELIX 
BUNDLE, ALPHA HELIX 


MEMBRANE PROTEIN FOUR HELIX 
BUNDLE. ALPHA HELIX 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL. 
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PHOSPHOINOSITIDE 
PHOSPHOTASE PTEN; CHAIN: 
A; 




ALPHA SPECTRIN; CHAIN: A, 
B, C; 


ALPHA SPECTRIN; CHAIN: A, 
B, C; 


| ALPHA SPECTRIN; CHAIN: A, 
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S YNTAXIN BINDING PROTEIN 
1; CHAIN: A; S YNTAXIN 1A; 
CHAIN; B; 


REPLICATION TERMINATOR 
PROTEIN; CHAIN: A; DNA 
16MER DUPLEX REPLICATION 
TERMINATOR; CHAIN: B, C; 


SSOl PROTEIN; CHAIN: A; 


SSOl PROTEIN; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTTNIN 2; CHAIN: A; 
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CONTRACTILE PROTEIN 












TRANSMEMBRANE PROTEIN 
COLICIN, BACTERIOCIN, ION 
CHANNEL FORMATION, 
TRANSMEMBRANE 2 PROTEIN i 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 

STRUCTURAL PROTEIN 


STRUCTURAL PROTEIN TWO 
REPEATS OF SPECTRIN, ALPHA 
HELICAL LINKER REGION, 2 2 
TANDEM 3-HELIX COILED-COILS, 
STRUCTURAL PROTEIN 
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CHAPERONE ARCHAEAL PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PftMOW* ~ ' 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED com 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTILE PROTEIN 


CONTRACTILE PROTEIN TRIPLE- 
HELIX COILED COIL, 
CONTRACTTLE PROTEIN 




COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 


COMPLEX (ZINC FINGER/DNA) 
COMPLEX (ZINC FINGER/DNA), 
ZINC FINGER, DNA-BINDING 
PROTEIN 
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HUMAN SKELETAL MUSCLE 
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HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTININ 2; CHAIN: A; 


HUMAN SKELETAL MUSCLE 
ALPHA-ACTIMN 2; CHAIN: As; 




QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C: 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX 
OLIGONUCLEOTIDE BINDING 
SITE: CHAIN: B, C: 


QGSR ZINC FINGER PEPTIDE; 
CHAIN: A; DUPLEX j 
OLIGONUCLEOTIDE BINDING 
SITE; CHAIN: B, C; 
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PREFOLDIN; CHAIN: 
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c: 
o 
<& 

I 
§ 

OS 
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INTERACTION, PROTEIN DESIGN, 2 1 
CRYSTAL STRUCTURE, COMPLEX { 
(ZINC FINGER/DNA) ! 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN, 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC 
FINGER, PROTEIN-DNA 
INTERACTION, PROTEIN DESIGN* 2 
CRYSTAL STRUCTURE, COMPLEX 
(ZINC FINGER/DNA) 


COMPLEX (ZINC FINGER/DNA) ZINC | 


Coum pound 


PROTEIN; CHAIN: Q F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZJNC FINGER 
PROTEIN; CHAIN: C } F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D,E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZJNC FINGER 
PROTEIN; CHAIN: C, F, G; 


DNA; CHAIN: A, B, D, E; 
CONSENSUS ZINC FINGER 
PROTEIN; CHAIN: C, F, G; 


w 

< 


SEQFOL | 
D store j 
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1 

C5 1 

a I 

1 

s 

g 

PL, 


FINGER PROTEIN; DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl , ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION^ COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG I; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION 
REGULATION/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 
(TRANSCRIPTION 
REGULATION/DNA) 


COMPLEX (TRANSCRIPTION [ 
REGULATJON/DNA) YING-YANG 1; 
TRANSCRIPTION INITIATION, 
INITIATOR ELEMENT, YYl, ZINC 2 
FINGER PROTEIN, DNA-PROTEIN 
RECOGNITION, 3 COMPLEX 


IS 

a 
g 

a 

a 




YY 1 ; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
CHAIN: A,B; 


YYl ; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
INITIATOR ELEMENT DNA; 
-CHAIN: A, B; 


YYl; CHAIN: C; ADENO- 
ASSOCIATED VIRUS P5 
! INITIATOR ELEMENT DNA; 
CHAIN: A, B; 
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SEQ 
ID 


Hit ID 


Speicfcs 


Description 


S score 


Percent 
identity 


1535 


gi 104389 
86 


Homo sapiens 


cDNA: FLJ22557 fis, cl&ie 
HS101483. % 


1405 


100 


1535 


gil52145 
57 


Mus musculus 


RIKEN cDNA 2410042lfcl 
gene I) 


1369 


86 


1536 


gil22749 
33 


Homo sapiens 


mRNA for alanine:glyox|late 
aminotransferase 2 homoffig 1, 
splice form 1 (AGXT2L1 gene). 


2018 


100 


1536 


gil28367 
24 


Mus musculus 


putative 


1689 


83 


1536 


gil58596 
90 


Homo sapiens 


unnamed protein product 


1189 


66 


1537 


gil40178 

4- / 


Homo sapiens 


mRNA forKIAA1815 protein, 


2117 


too 


1537 


gil23141 
j j 


Homo sapiens 


Human DNA sequence from 
clone RP1 1-207C16 on 
chromosome 9p23-24.3, 
Contains the 3' end of the gene 
for a novel protein similar to C. 
etegans R06F6.8 (Sw;Q094l7) 
(contains KIAA1432X the 3' end 
of the gene for a novel protein 
similar to predicted yeast, plant 
and worm proteins, ESTs, STSs 
and GSSs, complete sequence. 


2117 


100 


1537 


gil04399 
48 


Homo sapiens 


cDNA; FLJ23309 fis, clone 
HEP11618, 


1725 


99 


1538 


gU04371 
87 


Homo sapiens 


cDNA; FLJ2lI44fis f clone 
CAS07955. 


1919 


99 


1538 


AAB953 
60 


Homo sapiens 


HEU- Human protein sequence 
SEQIDNO:17664. 


1912 


99 


1538 


gil28518 
10 


Mus musculus 


putative 


1672 


84 


1539 


gi332703 
6 


Homo sapiens 


mRNA for KIAA061 1 protein, 
partial cds. 


4702 


100 


1539 


gi!39053 
02 


Mus musculus 


Similar to ATPase, class II, type 
9A 


3961 


98 


1539 


gi643496 
8 


Mus musculus 


putative E1-E2 ATPase 


3942 


98 


1540 


AAB939 
76 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:1405L 


2580 


100 


1540 


gil04338 
68 


Homo sapiens 


cDNA FLJ12401 fis f clone 
MAMMA1002796. 


2580 


100 


1540 


AAM698 
00 


Homo sapiens 


MOLE- Human bone marrow 
expressed probe encoded protein 
SEQ ED NO; 30106, 


1108 


100 


1541 


AAB949 
01 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 16290. 


1317 


99, 


1541 


gi 104327 
16 


Homo sapiens 


cDNAFLJH457fis, clone 
HEMBA1001522. 


1317 


99 


1541 


gi 128531 
91 


Mus musculus 


putative 

.* 


887 


62 


1542 


AAB735 
07 


Homo sapiens 


INCY- Human transferase 
HTFS44, SEQDDNO:14. 


1698 


99 


1542 


gi 165524 
96 


Homo sapiens 


cDNA FLJ32390 lis, clone 
SKMUS1000177, weakly 
similar to PROTEIN-L- 
ISOASPARTATE 0- 


1698 


99 
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SEQ 
ID 


Hit ID 


Spcicies 


Description 


S score 


Percent 
identity 








MEffe Y LTRANSFER ASE (EC 
2.1.1.27). 






1542 


gil 13231 
92 


Homo sapiens 


Humafe DNA sequence from 
clone WP5-1022E24 on 
chromosome 20 Contains the 3' 
end o$heOPRLl gene 
encoding Opiate receptor-like 1 
protein, the GPR8 gene encoding 
a G protein-coupled receptor, the 
KIAA0835 gene encoding a 
protein similar to the myelin 
transcription factor 1 (MYT1), a 
novel gene, 7 CpG islands, 
ESTs, STSs and GSSs, complete 
sequence. 


1156 


69 


1543 


AAB946 
44 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15536, 


1941 


100 


1543 


gil 46258 
75 


Homo sapiens 


mRNA For putative autophagy- 
related cysteine endopeptidase 
(AUTLlgene). 


1941 


100 


1543 


gil40426 
98 


Homo sapiens 


cDNAFLJ14867fis, clone 
PLACE1002319. 


1941 


100 


1544 


gi996720 
4 


Macaca 
fascicularis 


hypothetical protein 


2621 


100 


1544 


AAB883 
51 


Homo sapiens 


HELI- Human membrane or 
secretory protein clone 
PSEC0076. 


2616 


99 


1544 


gil 42725 
56 


Homo sapiens 


unnamed protein product 


2616 


99 


1545 


gil 20054 
29 


Homo sapiens 


homeobox-containing 
transcripton factor HOXD 1 
(HOXD1) mRNA, complete cds. 


1726 


100 


1545 


gil 10956 
18 


Homo sapiens 


HOXD1 protein (HOXD1) 
gene, complete cds. 


1726 


100 


1545 


gil 56802 
45 


Homo sapiens 


homeo box Dt, clone 
MGC:23144 IMAGE:4869019, 
mRNA, complete cds. 


1718 


99 


1546 


AAB953 
66 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 17677. 


2511 


100 


1546 


gil 40425 
38 


Homo sapiens 


cDNA FLJ14773fis, clone 
NT2RP3004566, weakly similar 
to ZINC FINGER PROTEIN 84. 


2511 


100 


1546 


gil 04386 
30 


Homo sapiens 


cDNA: FLJ22301 fis, clone 
HRC04777. 


2511 


100 


1547 


gil 55596 
28 


Homo sapiens 


mitochondrial ribosomal protein 
S5> clone MGC;20735 
IMAGE:4561399 t mRNA, 
complete cds. 


2262 


99 


1547 


giI36208 
81 


Homo sapiens 


MRPS5 mRNA for 
mitochondrial ribosomal protein 
S5j complex cds. 


2262 


99 


1547 


gil 36208 
83 


Mus musculus 


mitochondrial ribosomal protein 
S5 


1821 


79 


1548 


gil 26979 
11 


Homo sapiens 


mRNA for KIAA1683 protein, 
partial cds. 


1772 


72 


1548 


gil 20532 
39 


Homo sapiens 


mRNA; cDNA DKFZp4340194 
(from clone DKFZp4340194); 
complete cds. 


1690 


49 



201 



WO 02/070539 
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SEQ 
ID 


Hit ID 


Spcicics 


Description 


S score 


Percent 
identity 


1548 


~faui74 

r 


Homo sapiens 


HUMA- Novel signal 
transduction pathway protein* 
Seq ID 1 049. 


1659 


LOO 


1549 


|l04381 


Homo sapiens 


cDNA: FLJ21945 fis, clone 
HEP04702. 


3738 


99 


1549 


#104419 
56 


Homo sapiens 


clone PP384 unknown mKNA, 


1196 


99 


1549 


AAU222 
52 


Homo sapiens 


HUM A- Human cardiovascular 
system antigen polypeptide SEQ 
ID No 1026, 


480 


98 


1550 


gi!28524 
81 


Mus museums 


putative 


1049 


56 


1550 


AAB189 
66 


Homo sapiens 


INCY- Amino acid sequence of 
a human transmembrane protein. 


922 


89 


1550 


AAB427 
81 


Homo sapiens 


CURA- Human ORFX 
ORF2545 polypeptide sequence 
SEQIDNO:5090. 


513 


100 


1551 


gi63304O 
1 


Homo sapiens 


mRNA for KIAA1 199 protein, 
partial cds. 


2521 


97 


1551 


AAY257 
93 


Homo sapiens 


HUMA- Human secreted protein 
fragment encoded from gene 12 T 


1919 


96 


1551 


gi851818 
8 


Homo sapiens 


mRNA Full length insert cDNA 
clone EUROIMAGE 2155535. 


1352 


95 


1552 


AAB944 
17 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 1 501 6, 


1027 


99 


1552 


gil04348 
35 


Homo sapiens 


cDNAFLJI3018fis, clone 
NT2RP3000685. 


1027 


99 


1552 


gil28375 
67 


Mus musculus 


putative 


972 


92 


1553 


gU59294 
11 


Homo sapiens 


clone IMAGE:4040789, 
mRNA, partial cds. 


1369 


100 


1553 


AAG034 
90 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 7571. 


450 


96 


1553 


gi 128458 
28 


Mus musculus 


putative 


376 


68 


1554 


gi998895 
8 


Homo sapiens 


F-box prolein FBX30 mRNA, 
complete cds. 


1211 


100 


1554 


gil40437 
44 


Homo sapiens 


Similar to F-box only protein 6, 
clone MGC: 14140 
IMAGE:4054414, mRNA, 
complete cds. 


1211 


100 


1554 


AAB429 
89 


Homo sapiens 


CURA- Human ORFX 
ORF2753 polypeptide sequence 
SEQ IDNO:5506. 


675 


99 


1555 


gi938026 


Homo sapiens 


Human mRNA for RanBPl 
(Ran-bfnding protein 1)> 
complete cds. 


916 


100 

- 


1555 


AAB566 
19 


Homo sapiens 


ROSE/ Human prostate cancer 
antigen protein sequence SEQ 
IDNO:1197. 


904 


99 


1555 


gi620<*83 


Homo sapiens 


H.sapiens mRNA for RanBPl. 


904 


99 


1556 


gil33832 
65 


Homo sapiens 


mRNA for act in related protein, 
complete cds. 


1962 


100 


1556 


gil65500 
55 


Homo sapiens 


cDNA FLJ30784 fis, clone 
FEBRA2000881, moderately 
similar to ACT1N 6. 


1950 


99 


1556 


gi 139383 


Homo sapiens 


clone MGC: 15664 


1949 


99 



202 
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SEQ 
ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




19 




IMAGE:3349184, mRNA, 
complete cds. 






3557 


gi623545 


Oryctolagus 
cuniculus 


sarcopJasmic reticulum 
glycoprotein 


2366 


96 


1557 


gil64861 


Oryctolagus 
cuniculus 


sarcolumenin precursor 


2307 


97 


1557 


gi496325 


Gallus gallus 


53 kDa glycoprotein 


2160 


87 


1558 


AAB946 
41 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 15526. 


1379 


98 


1558 


AAG644 
03 


Homo sapiens 


SHAN- Human paneth cell 
enhanced expression-like 
protein. 


1379 


98 


1558 


AAM940 
28 


Homo sapiens 


HELI- Human stomach cancer 
expressed polypeptide SEQ ID 
NO 126. 


1379 


98 


1559 


gi874531 
5 


Homo sapiens 


putative GTP-binding protein 
(GTPBP2) mRNA, partial cds. 


2742 


99 


1559 


gii35610 
07 


Homo sapiens 


Human DNA sequence from 
clone RPl 1-22124 on 
chromosome 6 Contains the 3' 
part of the PQLH gene for DNA 
directed polymerase eta and the 
GTPBP2 gene tor GTP binding 
protein 2, ESTs, STSs, GSSs and 
a CpG island, complete 
sequence. 


2742 


99 


1559 


gi874531 
7 


Mus muscuius 


putative GTP-binding protein 


2731 


99 


1560 


AAG026 
53 


Homo sapiens 


GEST Human secreted protein, 
SEQ ID NO: 6734. 


425 


100 


1560 


gi 150804 
59 


Homo sapiens 


clone MGC9017 
IMAGE;3860059, mRNA, 
complete cds. 


425 


100 


1560 


gi233792 
0 


Homo sapiens 


Human syntaxin 7 mRNA, 
complete cds. 


421 


98 


1561 


AAB944 
68 


Homo sapiens 


HEU- Human protein sequence 
SEQ ID NO: 15 128. 


4375 


99 


1561 


gil04349 
44 


Homo sapiens 


cDNAFLJ13089fis, clone 
NT2RP3002108, weakly similar 
to DEC I PROTEIN. 


4375 


99 


1561 


gi730058 
1 


Drosophila 
melanogaster 


CG4845 gene product 


1083 


31 


1562 


AAU121 
77 


Homo sapiens 


GETH Human PRO305 
polypeptide sequence. 


226 


100 


1562 


AAY647 
34 


Homo sapiens 


GEST Human 5' EST related 
polypeptide SEQ ID NO:895. 


226 


100 


1562 


AAY814 
87 


Homo sapiens 


FUJY Human cathepsin L2. 


226 


100 


1563 


gil20527 
26 


Homo sapiens 


mRNA; cDNA 

DKFZp76IN0411 (From clone 
DKFZp761N0411); complete 
cds. 


2381 


99 


1563 


gil42509 
20 


Homo sapiens 


mRNA for SMC6 protein. 


2374 


99 


1563 


gil42509 
22 


Mus muscuius 


SMC6 protein 


2163 


88 


1564 


gil49705 


Homo sapiens 


mRNA for WDR9 protein 


524 


98 



203 
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SEQ 
ID 


Hit ID 


Spcicies 


Description 


S score 


Percent 
: identity 




62 




(WDR9 gene), form A. 




1564 


gil 49705 
91 


Mus musculus 


WDR9 protein, form A 


363 I 79 


1564 


AAB344 
87 


Homo sapiens 


HUMA- Human secreted protein 
BLAST search protein SEQ ID 
NO; 105. 


169 


IP** 

0 


1565 


gi680832 
9 


Homo sapiens 


mRNA; cDNA 

DKFZp434KO410 (from clone 
DKFZp434K0410); partial cds. 


787 


100 


1565 


AAB931 
88 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:12140. 


528 


51 


1565 


AAB927 
02 


Homo sapiens 


HELI- Human protein sequence 
SEQIDNO:11102. 


528 


51 


1566 


gi!28038 
41 


Homo sapiens 


Similar to retinoic acid induced 
12, clone MGC:3 373 
IMAGE:3629369, mRNA, 
complete cds. 


1576 


99 


1566 


AAB267 
93 


Homo sapiens 


UYFU- Human melanoma 
growth related factor- 1 amino 
acid sequence. 


1199 


95 


1566 


gi!32777 
02 


Mus musculus 


retinoic acid induced 12 


1 I £9 
I lo£. 


/J 


1567 


AAB534 
05 


Homo sapiens 


HUMA- Human colon cancer 
antigen protein sequence SEQ 
IDNO:945. 


1288 


93 


1567 


gi29587 


Homo sapiens 


Human rn RN A for carbon tc 
anhydrase 11 (EC 4.2. LI). 


i zoo 




1567 


gil 79795 


Homo sapiens 


Human carbonic anhydrase 11 
mRNA, complete cds. 


1288 


93 


1568 


gi!82146 


Homo sapiens 


eosinophil peroxidase (EPP) 
gene, exon 12 and complete cds. 


V757 


inn 


1568 


gi31183 


Homo sapiens 


Human mRNA for eosinophil 
peroxidase. 




07 


1568 


gil 77737 
8 


Mus musculus 


eosinophil peroxidase 


3376 


89 


1569 


AAB941 
83 


Homo sapiens 


HELI- Human protein sequence 
SEQ ID NO: 14500, 


2405 


inn 


1569 


gil 04342 
15 


Homo sapiens 


cDNA FLJI2618fis, clone 
NT2RM4001666, weakly similar 
to HYPOTHETICAL 48.6 KD 
PROTEIN IN ALPA-GABP 
INTERGENIC REGION, 


2405 


100 


1569 


gil35439 
55 


Homo sapiens 


Similar to hypothetical protein 
FLJ12618, clone MGC:12994 
IMAGE:3504996, mRNA, 
complete cds. 


2082 


94 


1570 


AAB939 
04 


Homo sapiens 


HELI- Human proteih sequence 
SEQ IDNO:13862. 


2499 


99 


1570 


gil 42862 
14 


Homo sapiens 


hypothetical protein FLJ12150, 
clone MGC:15043 
IMAGE:3634992, mRNA, 
complete cds. 


2499 


99 


1570 


gil 04335 
59 


Homo sapiens 


CDNAFLJ12150 fis, clone 
MAMMA 1 000422. 


2499 


99 


1571 


gi768460 
5 


Mus musculus 


smoothelin B 


640 


54 


1571 


gi768460 


Mus musculus 


smoothelin A 640 


54 
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ID 


Hit ID 


Speicies 


Description 


S score 


Percent 
identity 




4 










1571 


gi754725 
8 


Mus muscuius 


smoothclin small isoform ^1 


640 


54 


1572 


AAB950 
33 


Homo sapiens 


HELI- Human protein sequjmce 
SEQ ID NO: 1 6784. f 


1174 


100 


1572 


gi 104334 
42 


Homo sapiens 


cDNA FLJ12056 fis, clone 0 
HEMBB1002050. 


1174 


100 


1572 


gi530876 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain, aa 266 468; amino 
acid feature: globular protein 
domain, aa 32 „ 265 


142 


26 


1573 


gil28572 
47 


Mus musculus 


putative 


2111 


92 


1573 


gil07279 
09 


Drosophila 
melanogaster 


C06169 gene product 


688 


48 


1573 


gi23S891 
1 


Schizosaccharom 
yces pombe 


hypothetical PSUl-Hke protein 


585 


47 


1574 


AAB4I9 
54 


Homo sapiens 


CURA- Human ORFX 
ORF1718 polypeptide sequence 
SEQ IDNO:3436. 


1765 


97 


1574 


AAB427 
73 


Homo sapiens 


CURA- Human ORFX 
ORF2537 polypeptide sequence 
SEQ 1DNO:5074. 


1134 


93 


1574 


gil75I22 
54 


Homo sapiens 


hypothetical protein FLJ21 156, 
clone MGC:29459 
IMAGE:5020837, mRNA, 
complete cds. 


1089 


100 


1575 


AAY400 
90 


Homo sapiens 


HUM A- Peptide sequence 
derived from a human secreted 
protein. 


918 


98 


1575 


gil28546 
39 


Mus musculus 


putative 


443 


69 


1575 


gi!70661 
07 


Homo sapiens 


partial TTN gene for titin. 


86 


25 


1576 


gi 104384 
73 


Homo sapiens 


cDNA: FLJ22I84 fis, clone 
HRC00983. 


3291 


99 


1576 


gil02417 
12 


Homo sapiens 


mRNA; cDNA 

DKFZp761K08J6 (from clone 
DKFZp76lK08l6). 


1238 


99 


1576 


gi600118 


Zea mays 


extensin-like protein 


666 


33 


1577 


gi 165492 
61 


Homo sapiens 


cDNA FLJ30107fis, clone 
BNGH41000198, weakly similar 
to TETRACYCLINE 
RESISTANCE PROTEIN, 
CLASS E. 


1453 


100 


1577 


gil47150 
55 


Homo sapiens 


Similar to RIKEN cDNA 
1110002C08gene, clone 
MGC:9564 IMAGE:3S72267 } 
mRNA, complete cds, 


1453 


100 


1577 


gil28338 
45 


Mus musculus 


putative 

* 


1339 


90 


1578 


gil04376 
69 


Homo sapiens 


cDNA: FLJ21551 fis, clone 
COL06266. 


1925 


99 


1578 


AAE0179 
1 


Homo sapiens 


I1UMA- Human gene 22 
encoded secreted protein 
HOHDF66, SEQ ID NO:l 12, 


1840 


99 


1578 


AAB417 


Homo sapiens 


CURA- Human ORFX 


1473 


99 



205 



